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\  FOREWORD 


We  present  in  this  volume  the  papers  given  at  the  fourth  annual 
Clinic  on  Library  Applications  of  Data  Processing,  held  at  Urbana, 
Illinois,  from  April  24-27,  1966,  under  the  sponsorship  of  the  Uni- 
versity of  Illinois  Graduate  School  of  Library  Science. 

"The  Use  of  a  Remote  Console  in  Programmed  Teaching  by 
Computer"  is  the  title  of  the  speech  and  the  demonstration  presented 
by  Professor  Donald  L.  Bitzer  at  the  Clinic.    As  this  speech  was  not 
recorded,  we  are  reprinting,  with  the  permission  of  the  publisher,  an 
article  entitled  "The  Uses  of  PLATO,  a  Computer  Controlled  Teach- 
ing System,"  which  appeared  in  the  January  1966  issue  of  Audio  Visu- 
al Instruction.    This  article  by  Professors    Bitzer,  Lyman,  and  Easley 
contains  in  essence  the  material  presented  by  Donald  Bitzer  at  the 
Clinic. 

Not  included  in  this  volume  are  the  remarks  by  Neil  McMullan, 
Robert  Trocchi,  and  Philip  St.  Aubin  of  the  Minneapolis -Honey  well 
Company;  they  described  the  features  and  library  uses  of  Honeywell 
computers,  especially  the  200  series.    Also  omitted  from  this  publi- 
cation are  the  discussions  which  followed  each  paper  and  the  content 
of  the  problem  session. 

It  seems  clear  that  data  processing  has  already  affected  many 
libraries  and  is  likely  to  affect  many  more  in  the  near  future.    The 
university  and  special  libraries  have  been  among  the  first  to  become 
involved,  and  usually  at  the  computer  level;  school  and  public  li- 
braries are  beginning  to  be  involved,  and  usually  at  the  level  of 
punched  cards.    But  involvement  is  only  a  first  step  along  the  road 
to  successful  operation  of  effective  systems  of  data  processing;  to 
travel  this  road,  librarians  need  to  learn  many  new  things. 

In  this  process  of  successful  application  to  libraries  of  data 
processing,  the  University  of  Illinois  Graduate  School  of  Library 
Science  seeks  to  make  a  contribution  by  way  of  this  annual  conference 
or  Clinic.    The  focus  of  the  meeting  is  on  the  reporting  of  actual  ex- 
perience in  case  history  form.    We  seek  promising  and  fruitful  ex- 
perience (e.g.,  PLATO's  use  of  a  console -computer  hook-up)  and  not 
necessarily  successful  experience  alone;  for  surely  we  can  learn 
much  from  mistakes  and  failures  too.    Libraries  which  have  had  ex- 
perience with  computers,  which  should  be  shared  with  others  in  the 
profession,  are  invited  to  communicate  with  the  School  for  a  possible 
place  in  future  Clinics.    Besides  the  emphasis  on  case  reports,  the 


Clinic  pattern  includes  each  year  a  presentation  by  a  manufacturer 
of  data  processing  equipment,  a  paper  on  a  functional  technique  (in 
this  volume,  it  is  Jean  Perreault's  paper  "Approaches  to  Library 
Filing  by  Computer")  and  a  general  paper.    This  pattern  seems  to 
have  been  appropriate  for  the  registrants  at  the  Clinics  so  far  to 
judge  from  their  remarks  and  from  the  capacity  audience  we  have 
usually  had. 

The  planning  committee  for  the  1966  Clinic  consisted  of  Mrs. 
Frances  B.  Jenkins  and  Mr.  Dewey  E.  Carroll  of  the  Library  School 
faculty.    I  wish  to  thank  them  and  our  other  colleagues  for  their  active 
assistance  and  participation  in  the  Clinic.    The  speakers,  of  course, 
were  the  major  contributors  to  whatever  success  the  Clinic  had,  and 
all  of  us  here  appreciate  their  effort.    The  registrants  too  helped  in 
the  educative  process,  and  their  presence  and  cooperation  were  our 
reward. 

Herbert  Goldhor 
Urbana,  Illinois 
November,  1966 
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CURRENT  TRENDS  IN  LIBRARY  AUTOMATION 


Joseph  Becker 


The  punched  card  is  a  remarkable  invention  which  has  evolved 
into  a  multi -billion  dollar  industry  and  has  made  its  impact  felt  on 
almost  every  aspect  of  our  society.    The  genesis  of  the  punched  card 
can  be  traced  to  the  man  credited  with  inventing  it,  Herman  Hollerith. 
In  the  late  1800' s,  Hollerith  cut  a  card  to  the  exact  dimensions  of  the 
American  dollar  bill  and  devised  a  method  for  representing  a  number 
or  a  letter  in  the  identical  place  on  each  card  every  time.    Although 
the  dollar  bill  has  shrunk,  the  Hollerith  punched  card,  after  a  hundred 
years,  has  not  changed  by  a  millimeter !    Because  paper  is  a  non- 
conducting material,  it  is  possible  to  perform  counting  operations  by 
passing  electric  current  through  the  holes  in  the  card.    It  was  this 
simple  idea  which  helped  the  United  States  analyze  statistics  collected 
by  the  1890  census  and  which  later  led  to  many  other  applications, 
including  some  of  interest  to  libraries. 

Herman  Hollerith's  biography  in  the  Dictionary  of  American 
Biography^  relates  that  the  punched  card  idea  was  suggested  to  him 
by  a  librarian.    Hollerith  thus  reports  the  incident  in  one  of  his  let- 
ters:   "One  evening  at  Dr.  B's  tea  table  he  said  to  me,  'There  ought 
to  be  a  machine  for  doing  the  purely  mechanical  work  of  tabulating 
population  and  similar  statistics.'"    The  "Dr.  B"  whom  Hollerith  re- 
fers to  was  Dr.  John  Shaw  Billings,  who  was  then  Librarian  of  the 
Army  Surgeon  General's  Library  and  who  was  destined  to  become  the 
first  Director  of  the  New  York  Public  Library.    To  this  chance  re- 
mark, Hollerith  attributes  his  inspiration  for  the  development  of  the 
punched  card.    Since  it  was  a  librarian  who  started  it  all,  is  it  any 
wonder  that  Dr.  Billings'  professional  descendants  should  wish  to 
emulate  his  foresight  by  considering  possible  uses  of  data  processing 
in  libraries? 

Over  the  years,  punched  cards  were  gradually  applied  to  diverse 
areas  of  business,  and  by  1930  even  universities  were  employing  ma- 
chines to  perform  functions  associated  with  accounting  and  enrollment. 


Joseph  Becker  is  a  Lecturer  in  the  School  of  Library  Science  of 
Catholic  University  of  America,  Washington,  D.  C. 


It  was  just  about  at  this  time  that  Ralph  Parker,  then  a  junior  Li- 
brarian at  the  University  of  Texas,  conceived  of  the  idea  of  using 
punched  card  equipment  for  circulation  work.    The  Director  of  the 
University  of  Texas  Library  was  Don  Coney,  and  Parker  recalls, 
with  good  humor,  how  after  many  months  of  persuasion  Coney  finally 
gave  him  a  $300  grant  for  experimentation— but  only  after  cautioning 
him  to  spend  the  money  wisely!    Another  milestone  in  library  punched 
card  history  was  passed  in  the  following  decade  when  Marjorie  Quig- 
ley,  Librarian  of  the  Montclair  Public  Library  in  New  Jersey,  ac- 
quired special-purpose  equipment  for  controlling  book  transactions. 
This  system  of  circulation  control  was  the  first  to  adopt  the  method 
of  joining  a  machine -readable  book  card  and  a  machine -readable 
borrower's  card  in  a  single  master  record  at  borrowing  time.    This 
twenty-five-year-old  pilot  punched  card  installation  was  the  fore- 
runner of  the  IBM  357  Data  Collection  System  used  by  some  libraries 
today  for  computerized  circulation  work. 

The  last  twenty  years  have  witnessed  increasing  interest  among 
librarians  concerning  the  possibility  of  using  punched  card  machines 
and,  more  recently,  computers,  to  carry  out  many  other  library 
functions.    The  reasons  are  clear.    First,  the  rate  of  publishing  has 
climbed  steadily,  dramatically  increasing  the  number  of  printed 
pieces  to  be  acquired,  processed,  housed,  and  circulated  by  libraries. 
Secondly,  a  rapidly  expanding  and  more  literate  population  has  gener- 
ated demands  for  reader  services  that  have  far  exceeded  a  library's 
ability  to  respond  effectively  with  traditional  methods  and  techniques. 
Prospects  for  the  future  are  even  more  staggering.    Hence  the  pro- 
fessional librarian  has  been  prompted  to  look  for  help  to  the  new 
technology  available  in  modern  data  processing  equipment  and 
systems. 

Before  the  digital  computer  can  be  put  to  work  constructively  in 
libraries,  its  power  and  limitations  must  first  be  understood  by  the 
professional  librarian.    Data  Processing  Clinics,  such  as  this  one  at 
the  University  of  Illinois,  and  data  processing  courses,  which  are  be- 
ginning to  appear  in  library  schools  throughout  the  country,  provide 
excellent  opportunities  for  learning. 

Computers,  in  the  present  day  sense,  first  became  known  during 
the  late  1940' s.    Ungainly,  30-ton,  vacuum-tube  heavyweights,  they 
required  their  own  power  generators  and  air  conditioning  systems. 
Although  their  reliability  left  much  to  be  desired,  these  first- 
generation  computers  were  able  to  perform  mathematical  operations 
in  a  fraction  of  a  second— a  speed  several  orders  of  magnitude  greater 
than  was  possible  with  the  manual  processing  previously  available. 
A  second  generation  of  transistorized  computers  became  available  in 
the  late  1950's.    Most  of  the  machines  we  see  around  us  today  are  the 
results  of  this  metamorphosis.    They  are  faster,  cooler,  and  more 
accurate,  and  they  possess  versatility  and  personality  far  superior  to 


those  of  their  predecessors.    Furthermore,  these  machines  no  longer 
merely  compute;  they  are  capable  of  manipulating  with  equal  skill 
letters  of  the  alphabet,  words,  and  sentences. 

Many  and  varied  computer  programs  have  been  written  for 
second-generation  machines  to  solve  three  classes  of  library  prob- 
lems.   The  first  has  to  do  with  the  use  of  computers  for  supporting 
the  clerical  functions  found  in  technical  processing  and  circulation 
work.    Several  libraries  have  had  programs  written  which  cause 
computers  automatically  to  perform  certain  routine  work,  such  as 
interfiling  entries  in  a  catalog,  ordering  books  from  publishers,  writ- 
ing request  letters  to  the  Library  of  Congress  for  cards,  preparing 
serial  records  lists,  monitoring  circulation  operations,  printing  book 
catalogs,  and  analyzing  service  to  readers,  among  many  other  tasks. 
These  applications,  as  in  business,  are  designed  to  reduce  the  cleri- 
cal burden,  while  at  the  same  time  increasing  an  organization's  ability 
to  perform  more  work. 

A  second  category  of  computer  programs  that  concern  the  li- 
brarian is  the  field  of  information  storage  and  retrieval.    Here  the 
objective  is  to  develop  new  intellectual  methods  for  automatically 
extracting  meaning  from  a  text.    If  the  text  in  question  happens  to  be 
catalog  data,  the  problem  is  simplified  because  the  elements  of  bibli- 
ographic information  are  fixed  and  easily  identifiable.    However,  if 
the  purpose  is  to  correlate  facts  or  infer  subject  relationships  from 
the  complete  content  of  articles  and  books,  then  the  problem  becomes 
much  more  complex.    The  latter  area  will  require  additional  research 
before  we  can  see  clearly  how  such  a  capability  will  affect  the  duties 
of  the  reference  librarian. 

Until  recently  librarians  paid  scant  attention  to  the  application 
of  mathematics  and  computers  to  the  decision-making  process  in  li- 
brary management.    This  third  area,  referred  to  as  operations  re- 
search, is  one  which  employs  the  principles  of  scientific  management, 
but  whereas  scientific  management  is  concerned  with  efficiency,  oper- 
ations research  is  concerned  with  effectiveness.    At  the  turn  of  the 
century,  the  founder  of  scientific  management,  Dr.  F.  W.  Taylor, 
advocated  three  things:   (1)  an  inquiring  frame  of  mind,  which  refuses 
to  accept  past  practices  as  necessarily  correct,  (2)  the  replacement 
of  rules  of  thumb  by  more  carefully  thought  out  guides  to  action,  and 
(3)  the  collection  of  data  to  support  decisions  rather  than  reliance  on 
casual  judgments.    Operations  research  draws  on  these  principles 
and  extends  their  power  by  engaging  the  computer's  help.    Mathemati- 
cal models  are  used  to  characterize  a  process,  object,  or  concept  in 
precise  mathematical  terms.    When  models  are  incorporated  into  a 
computer  program,  it  becomes  possible  to  test  new  variables  in  order 
to  ascertain  how  the  process,  object,  or  concept  will  behave  under 
different  conditions.    A  library  administrator,  for  example,  can  thus 
learn  how  to  avoid  bad  surprises  by  evaluating  a  course  of  action  in 


advance  of  its  actual  implementation.  Johns  Hopkins  University  and 
Purdue  University  are  two  institutions  engaged  in  the  application  of 
operations  research  techniques  to  library  management  problems. 

The  computer  industry,  being  more  prolific  than  almost  any 
other,  announced  only  last  year  a  third  generation  of  computer  hard- 
ware.   Besides  offering  more  computer  power  for  the  dollar,  the  new 
equipment  has  several  engineering  advantages.    Its  components  are 
microminiaturized;  many  circuits  are  no  bigger  than  a  thumbtack. 
Its  memory  is  large  and  can  operate  at  speeds  measured  in  billionths 
of  seconds.    To  describe  a  billionth  of  a  second,  computer  people 
have  coined  the  word  "nanosecond."    A  nanosecond  is  a  very  thin 
sliver  of  time  indeed,  if  we  consider  that  it  is  to  a  second  what  a 
second  is  to  thirty  years!    The  ability  to  process  information  at  such 
lightning  speed  challenges  man  to  be  clever  in  the  ways  he  chooses  to 
use  such  power. 

Another  advantage  of  third -gene  ration  equipment  is  system 
versatility.    It  is  possible  to  connect  many  different  devices  to  the 
computer  over  standard  communication  links.    With  the  availability 
of  high -capacity,  direct-access  storage  devices  such  as  the  Data  Cell 
recently  announced  by  IBM,  we  can  now  think  in  terms  of  storing 
large  quantities  of  information  at  a  central  location,  yet  having  the 
data  available  to  numerous  users  at  distant  locations.    This  is  what 
is  meant  by  being  "on-line"  with  a  central  computer  and  "time- 
sharing" its  use. 

The  library  network  is  the  natural  extension  of  the  time -sharing 
idea,  and  automation  planning  at  the  Library  of  Congress  best  exem- 
plifies the  concept.    It  is  based  on  the  assumption  that  third -generation 
computers,  when  combined  with  communications  equipment,  will  per- 
mit local  and  regional  libraries  throughout  the  country  to  have  direct 
access  to  electronic  stores  of  information  in  Washington. 

The  notion  of  direct  dialing  to  a  digital  computer  has  already 
been  tested  at  several  experimental  locations  around  the  country. 
Elsewhere  in  this  publication,  M.  M.  Kessler  of  the  M.I.T.  Libraries 
describes  the  system  he  devised  to  link  100  users,  over  ordinary 
telephone  lines,  to  an  electronic  store  of  information  maintained  by 
a  central  computer  in  Cambridge,  Massachusetts.    His  interesting 
work  is  a  microcosm  of  the  kind  of  library  network  that  may  be  de- 
veloped in  the  future  on  a  much  greater  scale. 

The  fact  that  there  are  no  geographic  limits  to  the  library  net- 
work idea  is  particularly  attractive.    Last  summer,  at  an  ASLIB 
Conference  held  at  Ashridge  College  outside  London,  W.  J.  Bray, 
Deputy  Director  of  Research  of  the  British  Government  Post  Office, 
described  the  gradual  evolution  of  synchronous  satellites  and  the  ef- 
fect they  will  ultimately  have  on  world -wide  communications.    He 
traced  the  early  history  of  telecommunications  up  to  the  most  recent 
phase  of  laying  cables  by  submarine  across  the  Atlantic  and  Pacific 


Oceans.    By  way  of  contrast,  he  then  described  how  three  synchronous 
communication  satellites  in  circular  equatorial  orbit  around  the  earth 
can  reach  any  point  on  the  globe,  transmitting  and  receiving  not  only 
voice  and  video  traffic  but  also  the  digital  language  of  the  computer. 
He  predicted  that  communication  satellites,  such  as  EARLY  BIRD, 
will  permit  intercontinental  computer-to-computer  "conversations," 
which  eventually  will  facilitate  the  sharing  of  information  resources 
among  the  libraries  of  the  world. 

The  idea  of  creating  a  national  library  network  strikes  me  as 
the  most  exciting  development  that  data  processing  has  thus  far  of- 
fered us.    A  library  network  is  rich  with  the  promise  of  a  wholly  new 
approach  to  the  problem  of  gathering  and  retrieving  essential  infor- 
mation.   It  will  transform  libraries  into  active,  rather  than  passive, 
sources  of  knowledge  by  permitting  information  exchange  to  flow  in 
either  direction  between  library  and  patron.    The  ability  to  confer 
with  a  library  without  necessarily  visiting  it,  and  the  added  ability  to 
transfer  selected  information  from  library  to  home  or  office  for  indi- 
vidual use,  should  have  a  profound  effect  on  the  processes  of  research 
and  education.    If  we,  as  professional  librarians,  are  looking  for  an 
intellectual  challenge,  then  this  is  it. 
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THE  TECHNICAL  INFORMATION  PROJECT  OF  THE 
MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY* 


Project  MAC  at  MIT  has  been  mentioned  as  one  of  the  newer 
developments  in  the  computer  art.    In  Project  MAC,  a  man  in  Michi- 
gan can  sit  at  a  teletype  machine  and  interrogate  a  computer,  or  in 
general  behave  as  if  the  computer  were  next  door  to  him.    He  can 
program,  compute,  or  do  what  programmers  call  debugging  or  clean- 
ing up  a  program.    He  can  also,  as  of  a  few  months  ago,  type  a  re- 
quest such  as  "Compile  a  recent  bibliography  on  laser  physics"  or 
"What's  new  in  plasma  physics?"    It  is  this  application  of  Project 
MAC  that  will  be  described  here. 

When  the  man  in  Michigan  formulates  his  question,  when  he 
says  "I  want  the  latest  bibliography  on  lasers,"  he  does  not  get  just  a 
printout  of  a  previously  compiled  bibliography.    He  gets  the  results 
of  the  actual  putting  together  of  a  bibliography  based  on  thousands 
and  tens  of  thousands  of  documents  in  the  computer's  store,  since  it 
is  a  dynamic  system.    This  then  is  a  function  that  has  been  grafted 
on  the  MAC  computer  system  at  MIT,  and  it  is  necessary  to  under- 
stand the  technological  environment  of  Project  MAC. 

Project  MAC  was  originally  a  very  ambitious  notion;  some  of 
the  more  venturesome  people  at  MIT  thought  of  it  as  "Machine  Aided 
Cognition,"  which  is  a  very  revolutionary  notion,  but  it  is  more  re- 
alistically described  as  "Multiple  Access  Computer."    TIP,  the  other 
acronym  which  will  be  used  here,  stands  for  Technical  Information 
Program;  it  is  that  part  of  MAC  which  is  concerned  with  non- 
computational  uses  of  the  computer,  particularly  for  the  manipulation, 
storage,  and  retrieval  of  all  manner  of  technical  information. 


The  following  paper  is  based  on  the  tape  of  an  unscripted  talk 
given  by  M.  M.  Kessler  (Associate  Director  of  Libraries,  and  Direc- 
tor of  the  Technical  Information  Project  at  the  Massachusetts  Institute 
of  Technology).    Mr.  Kessler  has  kindly  granted  permission  for  the 
paper  to  be  printed  in  the  version  below,  with  the  proviso  that  it  repre- 
sents the  editor's  understanding  of  his  remarks;  neither  Mr.  Kessler 
nor  M.I.T.  is  to  be  held  responsible  for  the  precise  form  in  which 
they  appear  here. 
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The  essential  idea  of  MAC  is  that  a  computer  will  be  utilized 
as  is  a  public  utility.    Just  as  there  is  not  a  little  dynamo  for  each 
room  or  each  building,  so  a  computer  for  each  department  or  each 
project  is  not  needed.    One  central  computer  can  serve  the  entire 
community.    Now  whether  that  community  is  the  nation,  or  a  region, 
or  a  university  is  not  quite  clear.    It  most  certainly  is  not  the  nation, 
but  it  most  certainly  is  not  just  one  company  either.    At  this  time  we 
might  think  of  a  "public  utility"  computer  as  being  of  the  magnitude 
that  might  very  well  serve  a  region.    Now  obviously,  in  order  for  that 
to  be  realized  there  must  be  a  remote  access  facility.    And  so  the 
remote  access  idea  is  very  much  a  part  of  the  MAC  thinking,  and  it 
means  that  you  can  approach  the  computer  and  use  it,  without  any 
limitations,  from  a  remote  location.    As  of  this  time  there  are  two 
possibilities.    There  is  the  telephone  wire  approach  which  is  cheap 
and  ubiquitous;  all  you  have  to  do  is  attach  a  proper  piece  of  equip- 
ment at  the  end  of  a  telephone,  which  at  this  time  is  a  standard  tele- 
type machine.    A  more  sophisticated  approach  is  to  use  a  more  broad 
band  type  of  communication,  such  as  television  and  data  transfer  cir- 
cuits; but  for  any  sort  of  human  interaction  with  the  computer,  the 
telephone  system  is  fast  enough,  accurate  enough,  and  available 
enough  so  that  we  need  think  only  in  terms  of  telephone  connections 
to  the  computer.    And  so  the  approach  in  and  out  of  the  computer,  or 
the  input  -  output  machinery  so  to  speak,  is  a  standard  teletype  ma- 
chine connected  to  the  computer  by  means  of  telephone  wire.    Obvi- 
ously remote  access  is  a  very  important  aspect  of  MAC. 

The  other  important  aspect  of  the  public  utility  notion  of  a  com- 
puter is  time -sharing.    What  it  means  essentially  is  that  people  do 
not  line  up  serially  to  use  the  computer,  so  that  the  tenth  person  has 
to  wait  until  the  ninth  is  finished,  and  the  ninth  has  to  wait  until  the 
eighth  is  finished,  and  so  on.    Instead  there  is  a  long  trough,  and  they 
are  all  using  it  at  the  same  time.    MAC  can  accommodate  thirty  users 
at  the  same  time;  it  can  be  time -shared  by  thirty  people.    Strictly 
speaking  they  do  not  time -share  it,  and  they  do  use  it  in  rotation. 
However  the  rotation  is  so  fast  that  it  is  a  matter  of  seconds,  and  the 
printout  absorbs  the  cycle  time  because  the  computer  works  much 
faster  than  either  a  man  can  type  or  the  teletype  can  type  back  to  him. 
This  means  that,  while  one  person  is  typing,  if  he  hesitates  at  all  or 
if  his  attention  wanders,  somebody  else  is  using  the  computer.    The 
internal  machinery  is  such  that  the  computer  remembers  the  state  of 
the  last  operation,  and  when  it  comes  back  to  that  operation,  it  picks 
up  and  continues  another  few  milliseconds,  and  so  on.    It  is  not  liter- 
ally simultaneous,  but  virtually  so.    There  are  plans  to  increase  this 
facility  from  thirty  time -shared  users  to  possibly  one  hundred  and 
fifty. 

Besides  the  notion  of  it  being  remote  and  of  it  being  time- 
shared,  there  is  the  concept  of  being  on  line.    And  by  "on  line"  is 


meant  that  a  dialogue  can  take  place  between  the  person  and  the  ma- 
chine.   That  is,  you  do  not  bring  in  the  work,  give  it  to  an  operator, 
and  then  come  back  the  next  day  to  find  that  one  of  your  cards  was 
misplaced.    When  you  are  on  line,  if  you  make  a  mistake  the  com- 
puter immediately  tells  you  that  it  is  a  mistake,  and  if  you  want  to 
correct  something  you  can  do  it  immediately.    There  is  a  back  and 
forth  real  time  interaction  between  the  person  and  the  computer. 

A  subtle  psychological  factor  is  involved  here.    When  you  work 
with  a  batch  processing  system  that  is  not  on  line,  and  you  bring 
some  work  in  and  are  told  to  come  back  the  next  day,  and  you  come 
back  then  and  it  is  ready,  you  are  very  happy  indeed  that  it  is  ready 
twenty-four  hours  later.    But  if  you  sit  at  a  time-shared  console  and 
the  answer  comes  back  two  minutes  later  instead  of  two  or  three 
seconds  later,  there  is  a  great  deal  of  impatience  and  annoyance. 
You  don't  mind  waiting  twenty-four  hours,  but  you  do  mind  waiting 
two  minutes  instead  of  two  seconds.    This  is  not  entirely  irrational 
because  if  you  know  that  the  answer  is  coming  back  in  twenty-four 
hours  you  can  do  other  things,  but  if  you  expect  it  in  two  seconds,  and 
it  comes  back  in  two  minutes,  you  just  sit  there  and  stew.    This  is  an 
interesting  human  response,  and  there  is  apparently  a  critical  time 
delay. 

There  are  many  other  such  things  involved,  of  which  this  is  an 
example,  because  we  are  dealing  here  perhaps  for  the  first  time  with 
a  real  interactive  process  between  man  and  an  intelligent-like  ma- 
chine.   At  least  it  is  an  interaction  between  a  man  and  a  machine  by 
way  of  man's  intelligence.    You  are  sometimes  tempted  to  kick  at 
your  car  or  radio  and  take  an  anthropomorphic  attitude  towards  them. 
This  response  is  even  more  understandable  where  the  machinery 
simulates  some  intelligence,  and  you  expect  more  from  it  than  it  ac- 
tually gives  you;  the  psychological  strain  can  sometimes  be  consider- 
able when  you  realize  that  it  is  a  poor  dumb  beast,  and  not  even  that 
but  a  dumb  machine.    This  aspect  of  engineering  or  of  research  is 
extremely  important  in  our  type  of  application  because  we  are  think- 
ing of  using  the  computer  in  an  intellectual  manipulation  and  we  really 
do  not  know  what  the  answer  to  the  problem  will  be.    It  looks  pretty 
good  if  the  machine  can  be  made  much  bigger,  much  faster,  and  much 
cheaper.    This  is  very  important  because  if  the  machine  is  not  cheap, 
you  resent  its  not  only  wasting  your  time  but  wasting  $5  every  minute 
or  so,  and  that  gets  to  be  a  very  serious  resentment.    These  are  very 
important  considerations,  and  they  do  color  the  use  of  the  machine 
and  its  application  in  this  area. 

Fortunately,  the  computer  people  tell  us  that  computer  tech- 
nology can  only  go  one  way;  it  will  all  become  faster,  cheaper,  and 
smaller.    It  does  seem  that  things  are  moving  in  this  direction,  so 
that  we  can  look  forward  to  these  problems  being  solved.    However 
from  the  beginning  of  computer  technology  (which  is  not  more  than 
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ten  to  fifteen  years  old)  people  have  expected  great  things  from  the 
computer  in  the  way  of  library  and  information  application.    As  of 
today,  if  we  froze  the  technology  and  took  a  snapshot  of  the  situation, 
we  must  admit  that  the  application  or  the  contribution  of  computers 
to  our  type  of  problem  has  been  rather  trivial.    Here  and  there  a  list 
is  made  up,  or  a  little  bookkeeping  is  done,  or  a  circulation  system 
is  handled,  but  on  the  whole  the  computer  has  not  yet  made  its  impact 
on  library  and  information  sciences.    However,  the  inception  of  the 
computer  industry  was  only  about  twelve  or  fifteen  years  ago  at  the 
most,  and  the  next  ten  or  twelve  years  will  really  show  a  difference. 

To  proceed  with  the  description  of  the  MAC  system,  there  is  a 
computer,  and  the  computer  consists  of  two  parts,  the  processing 
unit  (with  circuitry  to  do  all  the  various  computer  manipulations),  and 
the  memory  unit  where  all  manner  of  information  is  stored.    In  the 
MAC  computer,  the  memory  is  divided  into  several  parts.    There  is 
first  of  all  a  part  of  the  memory  that  has  to  do  with  the  internal  work- 
ings of  the  computer,  e.g.,  the  addition,  subtraction,  or  whatever  else 
you  want  to  do  with  it;  this  concerns  only  the  computer  engineers,  the 
maintenance  people,  and  those  whose  responsibility  it  is  to  improve 
and  develop  the  system.    This  is  not  visible  to  anybody;  as  far  as  the 
user  is  concerned,  it  is  a  black  box  that  is  closed,  and  he  need  not 
know  what  is  inside.    There  is  another  part  of  the  computer  memory 
that  contains  within  it  various  facilities  which  are  available  to  the 
user.    These  involve  the  various  computer  languages,  such  as  For- 
tran, MAD,  or  whichever  it  is,  and  this  part  may  contain  worked-out 
programs,  so  that  if  you  want  to  perform  a  given  operation  you  do  not 
have  to  develop  the  program,  you  just  call  for  it.    This  then  is  a  li- 
brary of  publicly  available  routines,  programs,  and  data. 

The  rest  of  the  memory  is  divided  into  many  compartments. 
Each  compartment  is  accessible  to  only  one  user  or  subscriber,  and 
out  of  each  of  these,  schematically  and  not  to  dwell  on  the  actual  cir- 
cuitry, come  one  hundred  and  fifty  telephone  connections,  each  going 
to  a  console,  to  a  teletype  machine.    Thus  there  are  one  hundred  and 
fifty  locations  where  teletype  machines  are  installed,  and  that  location 
in  Michigan  is  one  of  them.    There  are  in  fact  about  twenty  or  so 
scattered  over  the  country;  the  rest  are  around  MIT.    Each  user  has 
at  his  disposal  a  common  library  of  programs,  languages,  facilities, 
and  so  on.    He  also  has  at  his  disposal  a  private  little  library,  his 
own  office  library,  which  he  can  put  things  into,  take  things  out  of, 
work  on,  and  do  all  kinds  of  things  that  are  available  to  him  only. 
Within  this  range  of  memory,  the  strictest  privacy  is  maintained. 
Great  care  has  been  taken  so  that  user  A  can  in  no  way  get  to  user 
B's  memory  unless  user  B  makes  that  facility  available. 

It  is  possible  for  any  one  of  the  users  to  take  part  of  his  private 
memory  and  declare  it  public,  and  that  in  every  sense  is  a  publication. 
As  a  matter  of  fact  we  have  an  editorial  board  and  referees,  and  we 
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go  through  the  entire  procedure  of  publication  before  any  user's  pri- 
vate file  is  made  available  to  others  as  a  public  facility.    To  give  a 
routine  example,  suppose  that  somebody  develops  a  new  way  of  inte- 
grating an  equation  or  a  new  way  of  solving  a  problem,  and  he  thinks 
that  it  may  be  of  use  to  others;  he  then  submits  it  to  the  editorial 
board  who  examines  it  and  approves  of  it  for  the  public  file.    As  soon 
as  that  is  done,  it  is  available  to  the  entire  community  of  users. 

It  is  with  regard  to  this  sort  of  scheme  that  we  have  developed 
TIP  which  stands  for  Technical  Information  Program.    A  rather  large 
slice  of  machine  memory  has  been  used  for  quite  some  time  as  our 
own  private  experimental  slice,  but  about  ten  months  ago  it  became 
sufficiently  developed  to  be  offered  to  the  public.    What  used  to  be  a 
private  memory  slice  has  now  joined  the  public  domain,  and  TIP  is 
now  available  to  all  of  the  150  consoles  that  have  access  to  MAC. 
There  are  150  consoles,  but  only  30  can  use  the  computer  simultane- 
ously.   In  fact,  since  more  than  one  person  has  access  to  any  one  of 
the  150  machines,  there  are  some  500  people  who  at  one  time  or  an- 
other use  Project  MAC. 

So  much  for  the  computer  structure  with  which  we  work.    Let 
us  now  look  at  TIP  in  more  detail.    When  we  first  started  thinking 
about  this  application,  it  was  clear  that  we  could  not  build  an  entire 
Library  of  Congress  or  MIT  library  system  into  a  computer,  turn  the 
switch  on,  and  then  have  it  work.    We  had  to  have  a  model  of  the  sys- 
tem, and  the  model  had  to  be  realistic  in  the  following  sense.    First 
of  all,  it  had  to  be  scalable,  so  that  if  it  worked,  it  could  be  scaled  by 
a  factor  of  10  or  100;  otherwise  it  would  be  a  toy  and  not  a  model. 
The  other  requirement  was  that  the  model  be  big  enough  to  be  capable 
of  functioning  in  a  real  environment  and  not  be  just  an  analytic  type 
of  model.    If  it  was  to  be  a  model  library,  it  had  to  be  a  big  enough 
library  so  that  people  who  want  to  use  libraries  would  actually  be 
motivated  to  come  to  this  thing  and  use  it.    Even  though  they  do  not 
care  about  this  as  an  area  of  research,  they  should  care  about  this 
as  a  service.    In  other  words  the  model  had  to  have  critical  size.    It 
had  to  be  small  enough  so  as  not  to  tax  the  experimental  facilities  of 
the  situation,  and  yet  it  had  to  be  big  enough  not  to  be  a  toy  but  to  be 
of  serious  interest  to  workers. 

As  a  result,  we  had  to  limit  our  literature  or  holdings,  and  we 
picked  physics  as  an  area  in  which  to  work,  more  or  less  by  chance 
because  I  am  a  physicist;  but  it  was  a  happy  choice  because  physics 
is  a  very  well  disciplined  literature.    More  than  that,  we  picked  the 
journal  literature  of  physics— no  reports,  no  books,  only  physics 
journal  articles.    We  process  now  twenty-seven  journals,  roughly 
1200  articles  per  month.    This  corresponds  to  about  60  percent  of  the 
physics  literature  which  ends  up  in  Physics  Abstracts.    For  each 
article  in  each  journal  as  it  comes  in,  we  put  into  the  computer  mem- 
ory the  following  information:    what  we  call  the  identification,  that  is 
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the  journal,  volume,  and  page  of  the  article;  the  title;  the  name  of 
the  author;  the  author's  location  or  institutional  connection;  and 
whatever  citations  or  bibliographical  references  are  in  the  article 
either  as  footnotes  or  in  the  body  of  the  text.    There  is  no  indexing 
or  key  word  identification  of  any  sort,  and  this  is  a  calculated  risk. 
One  of  the  important  considerations  was  that  the  system  or  model  be 
scalable,  and  one  of  the  most  difficult  things  to  scale  upward  is  human 
intelligence.    We  wanted  to  see  how  far  we  could  go  with  an  input  that 
is  purely  clerical.    What  we  record  for  each  article  requires  no  judg- 
ment whatsoever,  no  assignment  of  key  words,  or  indexing  terms— it 
is  purely  clerical.    Indeed  this  work  is  done  by  a  girl  who  sits  at  the 
teletype  machine  (we  do  not  use  IBM  cards)  and  types  directly  onto 
the  computer  memory  the  indicated  information— identification,  title, 
author,  location,  and  bibliography. 

The  question  of  course  is  whether  this  is  sufficient  for  a  legiti- 
mate system.    We  went  through  a  long  series  of  experiments  and  we 
are  satisfied  now  that  it  is  sufficient,  at  least  to  begin  with,  and  so 
we  are  not  doing  more.    However  the  system  is  flexible,  so  that  if  at 
a  later  date  we  want  to  add  other  things,  anything  at  all,  we  can  do  it 
by  just  typing  in  the  identification  data.    The  system  is  open  ended, 
and  if  more  information  is  needed,  it  can  easily  be  put  in. 

This  data  then  is  on  the  computer  memory  disc  in  a  format  that 
is  immediately  available  to  any  one  of  the  150  users.    In  other  words, 
there  is  no  loading  of  tapes  or  loading  of  cards;  the  information  is 
there  24  hours  a  day,  except  during  the  times  when  the  computer  is 
off  the  air.    It  takes  a  bit  of  computer  memory  to  do  that  but  it  is  not 
prohibitive.    This  information  is  organized  on  the  memory  disc  much 
as  it  would  be  organized  on  a  library  shelf.    Think  of  twenty- seven 
journals,  bound  into  volumes,  and  located  all  in  one  place;  in  our  case 
we  have  files,  and  for  this  purpose  each  volume  is  a  file.    And  within 
the  file,  within  the  volume,  the  information  is  organized  by  way  of 
page  numbers.    This  is  the  most  primitive  approach  to  file  organiz- 
ation, and  it  is  certainly  not  the  optimal  approach.    It  is  now  in  the 
process  of  being  changed  to  a  more  reasonable  approach  from  the 
machine  point  of  view  because  it  is  not  the  best  approach  for  large 
scale  searches.    As  our  library  gets  larger,  we  will  change  this  file 
structure,  but  as  of  now  it  is  a  serial  file. 

We  then  had  to  develop  a  set  of  words,  a  search  language,  be- 
cause we  set  ourselves  the  design  criterion  that  the  people  who  want 
to  use  MAC  for  real  purposes  must  not  be  asked  to  do  programming. 
They  are  librarians,  writers,  working  scientists— including  physi- 
cists—but they  are  not  programmers,  and  they  are  not  to  be  asked  to 
write  their  own  programs.    We  had  to  develop  a  language  that  was 
close  enough  to  English  to  make  communication  with  the  computer 
comfortable. 
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The  language  we  have  developed  is  a  very  comfortable  sort  of 
semi-English.    What  you  do  is  to  sit  at  the  typewriter,  log  in,  and 
identify  yourself  so  that  the  MAC  system  accepts  you  as  a  legitimate 
user.    You  then  type  the  word  "TIP,"  which  informs  the  computer  that 
it  is  about  to  be  used  as  a  library  and  not  as  a  computer.    There  are 
many  other  facilities  which  the  computer  has.    For  example,  civil 
engineers  have  developed  a  road  intersection  program  and  bridge  net- 
work program,  and  there  are  all  kinds  of  biological  and  psychological 
programs;  somehow  the  computer  has  to  be  informed  of  which  aspect 
of  its  personality  is  about  to  be  called  into  play,  so  to  speak.    When 
you  type  "TIP,"  that  indicates  that  you  are  interested  in  the  library 
part  of  the  computer.    The  computer  will  come  back  and  probably  say 
"Ready,"  or  some  such  thing;  when  it  says  "Wait,"  something  is 
wrong.    When  it  is  ready,  you  might  then  say,  "Search  Physical  Re- 
view, Volume  136  to  140,"  which  is  a  typical  search  command.    Or 
you  might  say,  "Search  Physical  Review,  latest  issue,"  or  "Search  all 
latest,"  which  means  search  the  latest  issue  of  all  the  twenty- seven 
journals.    In  other  words,  there  is  quite  a  variety  of  research  state- 
ments you  can  make,  each  of  which  essentially  means,  "Take  this  de- 
signated literature  off  the  shelf." 

Then  you  type  some  request,  like  "Find  title  nuclear,"  which 
means  "Find  every  title  in  this  literature  range  that  has  the  word 
nuclear  in  it."    Here  again  there  is  a  wide  variety  of  possibilities. 
You  can  say   "Find  title  nuclear  energy,"  in  other  words  you  can  use 
phrases,  or  you  can  say  "Find  nuclear  energy,"  and  in  that  case  it 
will  find  every  title  which  has  the  words  "nuclear  energy"  regardless 
of  their  order,  for  example,  "Production  of  energy  in  nuclear  engines." 
If  you  want  the  words  "nuclear  energy"  to  be  exactly  as  stated,  you 
put  an  asterisk  between  them  and  this  means  that  they  must  be  joined. 
There  is  a  wide  variety  of  other  manipulations;  you  can  say  "and/or 
but  not,"  for  example,  "Find  title  nuclear  but  not  spectroscopy. "    In 
other  words,  you  want  not  the  whole  set  of  literature  having  to  do  with 
nuclear  spectroscopy,  but  you  do  want  other  nuclear  literature.    You 
can  say  "Find  nuclear  and  author  such  and  such;"  you  can  mix  any  of 
these  things  with  the  logical  possibilities  worked  in.    And  of  course 
you  can  find  any  one  of  these  things  separately.    You  can  say  "Find 
citation  (and  name  the  journal,  volume,  page),"  and  that  means  "Search 
this  literature  and  find  every  article  that  cites  this  paper."    Or  you 
can  say  "Find  author  Smith,"  or  any  combination  of  these  by  way  of 
"and/or  but  not,"  and  so  on. 

By  these  directions  you  take  the  volume  off  the  shelf,  and  look 
through  it.    Having  found  what  you  want,  say  all  the  articles  with  the 
word  "nuclear"  in  their  titles,  what  do  you  do  with  it?   We  have  to  make 
some  sort  of  output  statement,  and  a  common  form  of  output  is  print- 
ing.   So  you  say  "Print  title,  author,  and  page  no."    In  other  words, 
you  direct  the  computer,  having  found  all  the  papers  with  "nuclear" 
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in  the  title,  to  print  the  complete  title,  and  the  author,  and  whatever 
else  you  wish  to  ask  for.    You  can  also  say  "Save"  because  if  you 
print,  you  erase  what  you  have  found  as  soon  as  it  is  printed  out,  but 
you  may  want  to  save  the  information  for  later  work.    Suppose  you 
have  discovered  230  articles  with  the  word  "nuclear"  in  their  titles, 
you  have  created  a  new  file,  a  new  list.    You  have  only  to  name  it 
somehow  if  you  wish  to  save  it;  so  you  say   "Save  file,"  and  follow  it 
by  a  name.    For  example,  you  might  call  it  "Nuclear  titles"  or  "List 
1"  or  "Jones'  favorite  subject";  you  must  give  it  some  name  because 
you  may  want  to  come  back  to  it  later.    Once  you  have  done  that,  you 
can  at  another  time  say  "Search"  and  instead  of  searching  Physical 
Review,  you  can  now  search  the  file  that  you  have  made.    This  is  a 
very  important  consideration. 

The  main  point  is  that  in  this  type  of  organization  and  language 
structure  there  is  the  possibility  for  searching  by  author,  by  any 
word  in  the  title,  or  by  what  is  known  as  citation  index;  all  of  these 
are  available  at  any  time  and  in  many  mixtures.    You  could  do  one, 
then  the  other,  and  so  on  and  back  and  forth,  and  you  can  save  that 
information  and  then  look  at  it  again  later  from  another  point  of  view. 
There  are  several  safety  features  built  in.    For  example,  if  you  re- 
quest an  item  that  is  not  in  the  library,  e.g.,  the  Journal  of  Gestalt 
Psychology,  the  computer  will  say  "The  Journal  of  Gestalt  Psychology 
is  not  in  the  TIP  library."    If  you  make  mistakes  in  spelling  and  things 
of  that  sort,  there  are  provisions  for  erasing  and  correcting  them. 
If  as  you  work,  or  as  the  computer  works  back  at  you,  the  telephone 
rings  and  you  want  to  go  away  for  a  while,  you  can  stop  and  then  come 
back  and  type  "Start,"  and  it  will  start  again  where  it  left  off,  even  if 
it  is  three  hours  later.    We  now  have  a  graduate  student  working  on  a 
teaching  program,  so  that  the  computer  itself  will  teach  people  how 
to  use  the  computer.    This  is  of  particular  interest  in  a  case  where  a 
non-programming  population  is  involved.    That  is,  we  would  like  to 
provide  for  the  situation  in  which  a  user  logs  in  and  says  "I  want  to 
use  TIP."    He  should  know  that  much.    Then  the  computer  comes  back 
and  says,  "Do  you  know  how  to  do  it  or  do  you  not?"    If  he  says  "No," 
a  teaching  program  comes  up  point  by  point  with  illustrations.    This, 
of  course,  is  very  closely  related  to  the  whole  man-machine  problem. 

A  second  very  useful  search  technique  has  been  developed, 
something  that  we  call  bibliographic  coupling.    Let  us  say  there  are 
two  or  three  articles  which  I  know  I  am  interested  in.    They  may  be 
my  own  papers,  or  they  may  be  the  papers  of  a  friend  who  is  an  ex- 
pert in  lasers,  and  I  know  that  I  am  interested  in  his  work.    Now  I 
want  to  be  able  to  conduct  a  literature  search  and  find  others  like  it. 
This  can  be  done,  provided  that  you  identify  the  criterion  of  likeness. 
You  may  say  to  the  computer    "Find  others  like  it,"  and  by  that  you 
mean  find  other  papers  which  share  something  with  this  paper.    They 
may  share  a  variety  of  things;  they  may  share  common  words  in  the 
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title,  or  they  may  share  the  fact  that  they  were  all  produced  in  or 
came  from  the  same  laboratory.    These  are  useful  but  rather  limited 
applications.    What  we  found  to  be  extremely  useful  is  to  say  "Find 
other  papers  like  these"  where  the  criterion  of  likeness  is  the  bibli- 
ography, or  the  list  of  citations  in  a  given  paper.    Then  we  are  saying 
in  effect,  "Examine  these  two  papers  that  I  know  are  of  interest  to 
me;  observe  the  citations  in  these  papers,  then  examine  some  given 
range  of  literature  and  find  other  papers  that  have  similar  items  in 
their  citations."    We  call  this  "bibliographic  coupling,"  because  the 
resultant  papers  are  coupled  not  with  words,  but  by  virtue  of  sharing 
certain  references.    For  example,  if  I  write  a  paper  with  twenty  items 
in  my  bibliography,  and  somebody  else  in  Japan  writes  a  paper  with 
twenty  items  in  his  bibliography,  and  if  it  happens  that  ten  are  the 
same  in  both  papers;  then  the  probability  is  very  high  that  these 
papers  are  related.    We  have  done  many  hundreds  of  such  experi- 
ments and  find  this  to  be  an  extremely  powerful  search  tool.    So  the 
share  group  of  programs,  added  to  the  find  group  of  programs  de- 
scribed previously,  is  very  important.    We  can  say  "Share  title"  with 
the  named  paper,  or  "Share  author"  and  so  on;  but  of  all  those  shared 
programs,  the  shared  bibliography  is  the  most  potent  search  tool. 

In  the  hierarchy  of  search  procedures,  we  have  the  find  list  of 
programs  and  the  share  list  of  programs.    Beyond  that  we  have  an- 
other even  more  sophisticated  search  procedure.    (By  the  way,  the 
number  of  papers  on  disc  in  the  TIP  library  is  now  about  35,000  and 
growing  at  the  rate  of  1200  per  month;  this  is  pretty  close  to  four 
years  of  literature.)    Let  us  say  we  want  to  produce  a  bibliography  on 
lasers.    There  are  various  strategies  one  can  use.    You  can  say 
"Search  all,"  that  means  "Search  the  entire  library,"  or  you  can  say 
"Search  1965  papers"  or  anything  of  that  sort. 

Then  you  can  say  "Find  title  containing  the  word  laser,"  and  if 
you  put  a  plus  in  front  of  laser,  the  computer  will  find  anything  where 
the  suffix  is  laser,  e.g.,  super-laser  or  anything  of  that  sort;  and 
sometimes  it  is  convenient  to  do  this.    Then  for  output,  you  can  say 
"Print  and  save,"  and  call  the  saved  file  "Laser  titles."    In  a  typical 
case,  this  will  result  in  about  230  papers.    These  are  230  papers  out 
of  those  35,000  which  have  the  word  laser  in  the  title. 

This  by  no  means  exhausts  the  laser  literature  obviously,  be- 
cause there  are  many  papers  that  have  to  do  with  "lasers"  which  do 
not  have  the  word  laser  in  the  title.    But  certainly  those  that  do  have 
"laser"  in  the  title  are  important,  and  we  have  now  compiled  and  saved 
this  first  list  of  all  papers  that  have  the  word  "laser"  in  their  title. 
The  next  step  is  to  say  "Search  this  new  list  of  laser  titles";  we  do  not 
search  the  entire  literature  of  35,000  now,  we  search  only  230  papers. 
Actually  the  direction  is  "Read  laser  titles,"  by  which  is  meant  that 
the  computer  is  asked  to  list  all  the  references  in  these  papers  in  the 
order  of  their  frequency  of  occurrence;  out  of  230  papers,  there  will 
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be  perhaps  3000  references— slightly  more  than  10  per  paper.    Most 
of  them  of  course,  are  referred  to  once  and  never  again.    Some  are 
referred  to  twice,  and  so  on;  these  3000  references  then  will  be 
listed  in  the  order  of  their  frequency  of  appearance  in  this  group  of 
papers,  and  we  end  up  with  a  list  that  says  "  Paper  1  in  this  list  of 
references  appeared  17  times";  "Paper  2,  15  times,"  and  so  on.    To 
do  all  this  will  take  5  or  6  seconds. 

Of  course,  one  might  take  just  a  slice  of  this  and  ask  for  a  list 
of  all  papers  cited  more  than  five  times,  and  go  look  at  those.    But 
what  is  even  more  interesting  is  to  say  "Let  us  go  back  to  the  litera- 
ture, search  all  of  it  again,  and  find  all  of  those  papers  that  do  not 
have  "laser"  in  the  title,  but  nevertheless  cite  these  same  refer- 
ences."   For  example,  someone  may  have  used  a  synonym  for  laser, 
namely  optical  maser,  or  some  other  term.    This  second  complete 
search  then  gives  you  another  sample  of  relevant  articles.    You  look 
at  this  new  set,  and  ask,  "Here  is  a  set  of  papers  concerned  with 
lasers,  and  the  coupling  is  very  good,  although  they  do  not  have  the 
word  laser  in  their  titles.    What  are  the  most  frequent  words  in  their 
titles?"    In  other  words,  "What  are  the  most  frequent  and  common 
actual  substitutes  for  the  word  laser?"    Perhaps  you  come  out  with 
two  or  three  terms,  such  as  optical  maser  or  optical  amplifier;  then 
you  go  back  through  the  literature,  and  search  all  titles  for  those 
words.    This  goes  on  back  and  forth,  and  each  time  you  form  a  new 
list;  there  are  in  fact  list  merging  programs  that  say  "Go  back  to  the 
original  list,  and  if  this  new  article  is  not  there,  add  it;  if  it  is  a 
duplication,  do  not."    The  list  which  started  with  230  articles  may 
add  50  in  the  next  process,  and  30  in  the  next;  you  continue  searching 
as  long  as  you  wish,  until  you  no  longer  add  anything.    Then  you  must 
be  at  the  end. 

This  sort  of  procedure  lends  itself  to  many  different  kinds  of 
manipulation,  and  we  call  them  strategies.    And  this  particular  strate- 
gy just  described,  in  which  you  start  with  a  word  and  then  go  to  fre- 
quency distribution  of  references,  and  back  to  the  word  and  so  on,  is 
called  Strategy  A  (since  it  was  the  first  strategy  which  worked  well 
for  us).    You  can  sit  at  the  teletype  machine  now  and  say  "Execute 
Strategy  A  on  lasers;"  you  do  not  have  to  go  through  all  of  the  steps 
just  described,  and  the  computer  will  proceed  to  perform  each  step 
in  the  whole  process.    This  is  a  high  order  of  programming  which  is 
extremely  potent. 

The  system  is  now  in  use  by  something  like  75  people  per  week. 
In  other  words,  75  people  log  in  and  ask  for  TIP  to  become  available 
to  them.    We  have  written  a  monitor  system,  so  that  everybody  who 
is  using  the  system  gets  recorded  as  who  he  is,  how  long  he  is  using 
it,  what  questions  he  asks  and  so  on;  and  we  are  collecting  user  ex- 
perience now  through  this  monitor  program.    It  is  also  being  used  by 
librarians  in  a  much  more  self-conscious  way.    The  system  has  been 
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in  use  about  six  months  in  this  public  fashion;  we  have  had  it  in  use 
experimentally  for  over  a  year  now,  but  it  was  only  about  six  months 
ago  that  it  went  public,  so  to  speak. 


THE  JOHNS  HOPKINS  UNIVERSITY  LIBRARY 


Benjamin  Courtright 


The  purpose  of  this  paper  is  to  describe  two  projects  relevant 
to  the  interests  of  this  conference  which  are  active  at  the  Johns 
Hopkins  University.    These  are,  first,  the  conversion  of  the  shelf  list 
of  the  University  Library  to  machine -readable  form,  and,  second,  the 
design  and  operation  of  a  new  type  of  computer-based  circulation 
system. 

These  activities  were  initiated  and  are  being  carried  out  at 
Johns  Hopkins  as  part  of  an  operations  research  and  systems  engi- 
neering study  of  university  libraries,  under  the  sponsorship  of  the 
National  Science  Foundation.    The  leadership  in  initiating  and  direct- 
ing this  study  has  from  the  beginning  been  that  of  Robert  H.  Roy, 
Dean  of  the  School  of  Engineering  Science  and  Chairman  of  the  De- 
partment of  Operations  Research  of  the  University. 

The  Project  team  has  been  fortunate  throughout  in  having  a 
close  and  cordial  working  relationship  with  the  staff  of  the  Johns 
Hopkins  University  Library.    From  providing  a  field  for  examination 
and  case  study  of  systems  problems,  to  sympathetic  and  responsive 
reception  of  proposals  for  action,  the  Library  has  been  friend, 
teacher,  and  subject  all  at  once.    This  fortunate  and  indeed  essential 
state  of  affairs  has  been  due  above  all  to  the  perception,  the  vision, 
and  the  understanding  of  the  University  Librarian,  John  Berthel. 

The  support  of  the  National  Science  Foundation  has  been  crucial. 
The  cost  of  the  work  is  small  in  terms  of  the  long-range  objectives 
which  are  at  stake  here,  but  without  the  far-sighted  support  of  NSF 
these  undertakings  could  not  have  been  attempted  at  all. 


Conversion  of  the  Shelf  List  to  Magnetic  Tape 


It  is  hardly  necessary,  at  this  date,  to  dwell  on  the  desirability 
of  having  the  shelf  list  of  a  library  in  machine -readable  form.    Suffice 


Benjamin  Courtright  is  a  Research  Associate  in  the  Department  of 
Operations  Research  of  the  Johns  Hopkins  University,  Baltimore, 
Maryland. 

18 


19 

it  to  say  that  Johns  Hopkins  arrived  at  this  evaluation  quite  independ- 
ently, in  1963,  after  a  careful  consideration  of  system  problems. 
The  orientation  was  definitely  not  that  of  electronic  data  processing 
(EDP),  in  the  first  place.    In  order  to  apply  modern  control  techniques 
and  optimization  procedures  to  the  problems  of  library  organization 
and  operation,  we  needed,  in  particular,  data  on  the  library's  primary 
resource,  its  holdings.    The  shelf  list  is  monumental,  a  single,  unique 
file,  ninety  years  in  the  making.    For  micro-examination,  for  looking 
at  one  or  a  hundred  items  in  the  collection,  it  was,  and  is,  quite  ser- 
viceable; but  as  a  basis  for  control  measures  for  the  collection  as  a 
whole,  for  coping  with  the  new  generation  of  library  problems,  it  was 
simply  unusable.    No  one  has  ever  regarded  it  as  indigestible,  only 
because  no  one  has  ever  conceived  of  having  to  swallow  it.    Making 
this  file  assimilable  became  an  integral,  essential  part  of  the  so- 
lution to  problems  of  inventory  and  flow  in  the  library. 

This  account  will  omit  a  detailed  description  of  the  specific  li- 
brary problems  which  motivated  the  shelf  list  conversion,  and  the 
proposed  solutions  to  them,  as  well  as  a  consideration  of  the  various 
means  available  to  accomplish  the  conversion,  and  the  factors  affect- 
ing the  choice  to  be  made  among  them.    Both  topics  have  been  covered 
in  published  progress  reports.    The  following  then  is  a  case  report 
of  actual  operating  experience  with  the  processing  involved. 

The  conversion  process  decided  upon  was  accomplished  by  the 
following  steps: 

(1)  Producing  a  working  copy  of  the  shelf  list  by  microfilming. 

(2)  Transcribing  the  desired  elements  of  information  for  each 
record  in  the  file  and,  at  the  same  time,  producing  optically  scannable 
copy,  by  typing  the  elements  desired  on  a  typewriter  equipped  with 
the  special  font  required  for  the  scanning  process. 

(3)  Automatically  converting  to  magnetic  tape  on  an  optical 
scanning  (character  recognition)  device. 

(4)  Editing,  reformatting,  and  updating  the  cumulated  record  on 
magnetic  tape. 


Microfilming— The  need  for  a  working  copy  of  the  shelf  list  is  abund- 
antly clear  to  everyone  who  has  had  contact  with  library  procedures. 
The  conversion  process,  including  set-up  time,  took  two  to  three 
months;  to  have  attempted  this  from  the  shelf  list  itself,  even  if  it  had 
been  possible  to  do  it  in  the  immediate  vicinity  of  this  unique  file, 
would  have  produced  endless  confusion  and  frustration  for  everyone 
concerned.    The  choice  of  microfilming  for  producing  the  working 
copy  has  been  a  wholly  satisfactory  one.    The  cost  of  making  the 
microfilm  copy  has  turned  out  to  be  much  less  than  the  cost  for  any 
other  available  method.    The  completed  microfilm  copy  is  very  easily 
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handled  and  stored;  the  microfilm  replica  of  the  entire  shelf  list  fits 
into  a  box  about  1-1/2  feet  on  each  side.    Above  all,  the  microfilm 
copy  did  the  job  desired— it  served  as  an  adequate  working  basis  for 
the  conversion  process.    Although  they  had  had  no  prior  experience 
with  the  use  of  this  medium  for  any  job  of  this  nature,  we  were  de- 
lighted to  find  that  the  typists  worked  about  as  effectively  from  the 
microfilm  reader  as  they  do  from  original  copy. 

Microfilm  does  have  one  signal  disadvantage:    it  is  impossible 
to  edit  it  or  mark  it  for  any  intended  use.    Acceptance  of  this  limi- 
tation forced  the  staff  into  the  lengthy  and  laborious  process  of  con- 
structing a  very  careful  and  detailed  set  of  decision  rules  for  the 
typist  to  follow  in  reading  and  abstracting  the  data.    Instead  of  depend- 
ing on  the  editing  of  an  experienced  librarian  or  subject  matter  spe- 
cialist the  staff  was  forced  into  an  explicit  codification  of  the  treat- 
ment for  each  of  the  many  varieties  of  form  encountered.    The  end 
product  of  such  a  process  cannot  be  as  good  as  that  achieved  through 
pre-editing  by  an  experienced  and  competent  person.    On  the  other 
hand,  the  saving  in  time  and  in  cost  is  very  considerable;  the  process 
of  forming  explicit  guidelines  is  in  many  ways  a  salutary  one;  and  we 
believe  that  through  the  exercise  of  planned  safeguards  and  checks, 
the  final  product  will  stand  comparison  very  well. 

One  such  safeguard,  for  example,  was  a  direction  to  the  typist 
to  enter  a  special  symbol  "XXX"  in  place  of  the  record  element,  when- 
ever she  encountered  a  form  of  entry  which  seemed  to  her  not  to  be 
covered  adequately  by  the  rules.    The  last  step  in  the  conversion 
process  has  as  one  of  its  parts  the  printing  of  a  list  of  records  con- 
taining these  symbols,  by  a  simple  computer  program,  for  checking 
by  a  trained  technical  person.    This  means  that  such  attention  must 
be  given  only  to  the  2  or  3  percent  of  the  records  containing  the 
anomalies,  instead  of  to  every  record  in  the  file,  and  was  the  last 
step  in  the  conversion  process. 

The  microfilming  was  done  on  a  Recordak  Model  RM  automatic 
microfilmer.    This  machine  has  an  automatic  feed,  with  a  rated  ca- 
pacity of  375  cards  per  minute.    The  effective  operating  rate  was 
about  200  cards  per  minute,  allowing  for  the  necessary  shifting  of 
cards  and  personnel  breaks,  and  the  entire  operation  was  performed 
over  one  weekend,  when  the  Technical  Services  Division  was,  of 
course,  closed.    The  costs  comprised  $480  for  the  rental  of  the 
microfilmer;  $229  for  twenty-four  200 -foot  rolls  of  16  mm.  micro- 
film, including  processing;  and  $253  for  the  wages  of  the  personnel 
employed  for  the  purpose.    A  major  part  of  the  latter  cost  was  for 
some  sorting  and  re-sorting  that  had  to  be  done  on  the  file  in  the 
process.    The  total  cost,  then,  was  $962. 


Transcribing— Whether  to  convert  all  of  the  information  on  the  shelf- 
list  card,  or  only  some  part  of  it,  was  perhaps  the  hardest  decision 
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of  all.    The  question  was  considered  from  every  point  of  view,  over  a 
range  of  possible  alternatives.    The  final  decision  was  made  on  the 
basis  of  the  marginal  utility  of  the  various  elements  of  information 
and  on  the  basis  of  the  specific  functions  for  which  the  machine- 
readable  record  was  proposed.    A  detailed  consideration  of  these 
functions  is  set  forth  in  the  April  1963  Progress  Report  of  the  pro- 
ject.   The  fields  of  information  entered  into  the  record  are  as  follows: 

(1)  Call  number.    The  complete  call  number  is  taken  in  every 
case.    This  includes  the  date  of  publication  for  modern  cataloging. 

(2)  Main  entry.    For  corporate  entries,  the  entire  entry  is 
transcribed.    For  personal  authors,  the  last  name  and  initials  are 
taken.    Authors'  dates  are  omitted. 

(3)  Title.    This  is  the  full  title  in  most  cases,  a  shortened  form 
in  the  case  of  very  long  titles  and  certain  other  cases.    Subtitles  are 
omitted. 

(4)  The  number  of  pages,  or  the  number  of  volumes  in  the  case 
of  a  multi-volume  set.    (The  number  of  pages  for  each  volume  is  not 
given  in  the  latter  case,  in  local  cataloging.) 

(5)  The  physical  size  of  the  book.    This  is,  as  usually  given,  the 
height  in  centimeters  for  normally  shaped  books,  or  both  the  height 
and  the  width  where  given  under  Library  of  Congress  (LC)  cataloging 
rules  for  unusually  shaped  books. 

(6)  The  number  of  copies  of  the  work  in  the  Library's  collection. 

These  basic  elements  of  information  have  been  transcribed  for 
every  item  in  the  Library,  with  appropriate  coding  so  that  any  given 
field  of  the  information  can  be  retrieved  at  will.    The  records  were 
transcribed,  as  indicated  above,  from  microfilm  readers  onto  type- 
writers equipped  with  the  special  font  used  for  the  optical  character 
recognition  process. 


Optical  Scanning— In  a  survey  conducted  to  plan  for  the  optical  scan- 
ning phase  of  the  process,  two  organizations  in  the  country  having 
fully  operational  pieces  of  equipment  with  sufficient  sophistication  to 
do  the  job  were  found.    The  company  chosen  was  the  Control  Data 
Corporation  (CDC),  whose  wholly-owned  subsidiary,  the  Rabinow 
Engineering  Corporation,  had  developed  pioneer  models  of  such  de- 
vices.   Having  had  no  experience  with  either  of  the  organizations  con- 
tacted, the  choice  of  CDC  was  made  on  quite  explicit  grounds;  the 
scanning  device  used  by  the  other  company  accepted  a  line  of  only 
seventy  characters,  in  contrast  to  a  line  of  100  characters  for  the 
Rabinow  machine;  the  operating  rates  of  the  other  company's 
machine  were  much  slower  and  the  formatting  less  flexible  than 
that  provided  by  Control  Data  Corporation's  operation.    The  loca- 
tion of  the  CDC  machine,  in  Rockville,  Maryland,  has  also  proved 
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a  considerable  benefit,  in  view  of  the  frequent  coordination  requir- 
ed in  the  planning  and  execution  of  the  work. 

This  machine  is  a  one-of-a-kind  device,  built  specifically  for 
in-house  developmental  testing  and  service  bureau  operation.    Two 
different  types  of  agreement  were  concluded  with  CDC,  both  of  which 
have  functioned  satisfactorily.    Under  the  first  of  these  CDC  took  the 
microfilm  copy  of  the  shelf  list  and  assumed  responsibility  for  the 
conversion  process.    They  subcontracted  the  typing  work  and  assumed 
responsibility  for  the  implementation  of  the  conversion  rules  referred 
to  above,  subject  of  course  to  continuous  coordination  with  Johns 
Hopkins  in  their  application.    Following  the  optical  scanning  process, 
they  reprogrammed  the  raw  output  of  the  scanner,  converting  it  into 
the  bit  code  appropriate  to  the  computers.    The  guaranteed  accuracy 
for  the  whole  operation,  specified  in  the  contract,  is  as  follows: 

(1)  For  the  call  number.    Since  this  serves  as  an  identifying 
code  for  the  machine  record,  as  well  as  for  the  many  functions  it  has 
in  its  own  right,  the  maximum  obtainable  accuracy  was  specified 
here.    The  terms  of  the  guarantee  are  expressed  as  "virtually  100 
per  cent  accuracy";  the  procedure  used  was  to  make  a  visual  check 
from  the  print-out  of  the  final  product  against  the  original  microfilm 
for  this  field,  in  addition  to  the  error  detection  routines  incorporated 
for  the  record  as  a  whole,  with  both  CDC  and  the  typing  subcontractor 
assuming  responsibility  for  the  checking. 

(2)  For  the  remainder  of  the  record.    The  guarantee  specifies 
that  the  number  of  errors  shall  not  exceed  2  per  100  records. 

These  guarantees  have  seemed  eminently  satisfactory,  probably 
exceeding  the  accuracy  of  most  original  copy.    We  are  not  as  yet  pre- 
pared to  make  a  statement  concerning  our  own  sampling  for  verifi- 
cation of  these  levels,  but  there  is  no  reason  to  believe,  on  the  basis 
of  experience,  that  CDC  would  fail  to  take  full  responsibility  in  the 
event  that  our  checking  shows  these  rates  not  to  be  maintained  for 
some  part  of  the  completed  record.    Their  responsibility  would  take 
the  form  either  of  cash  rebates  for  our  cost  in  instituting  accurate 
correction  procedures,  or  of  their  redoing  the  work  in  question. 

The  total  cost  for  this  processing  is  at  the  rate  of  4.8  cents  per 
record. 

Under  the  other  form  of  agreement,  the  typing  is  done  by  us, 
and  the  typescript  is  shipped  directly  to  the  scanner  for  automatic 
conversion.    The  output  in  this  case  is  the  raw  output  from  the  scan- 
ner written  on  magnetic  tape;  here  we  do  the  subsequent  conversion 
processes  and  reformatting  on  our  computer.    The  cost  for  this  part 
of  the  operation  is  at  the  rate  of  2  cents  per  line  scanned.    The  scan- 
ning device  accommodates  100  characters  per  line;  our  records  ex- 
ceed this  length  in  8  to  10  percent  of  the  total  cases. 
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The  second  form  of  agreement  mentioned  above  was  instituted 
in  order  to  provide  a  solution  to  a  specific  operating  problem,  but  the 
procedure  has  become  interesting  in  its  own  right.    The  problem  re- 
ferred to  is  the  conversion  of  items  from  a  separate  serials  catalog. 
In  the  preliminary  survey  and  planning,  because  the  items  in  the 
serials  catalog  were  sufficiently  more  complex  in  their  bibliographic 
record  than  the  monographs,  it  was  preferable  not  to  follow  the  first 
procedure  outlined  above  in  this  case.     After  much  exploring,  a  very 
intelligent  and  competent  person  with  library  experience  was  hired, 
to  work  solely  on  the  serial  records  for  a  period  of  time.    A  suitable 
typewriter  was  obtained,  an  IBM  Selectric,  fitted  with  a  typing  ele- 
ment having  the  special  scannable  font  of  characters,  and  set  up  in  a 
place  immediately  adjacent  to  the  serial  librarian's  office  for  ease 
of  reference.    This  was  the  first  time  that  such  a  procedure  had  been 
tried,  but  after  some  initial  difficulties  it  was  found  to  work  very 
well.    Moreover,  it  was  found  that  the  overall  cost,  per  record  con- 
verted, is  very  nearly  the  same  as  under  the  other  arrangement. 
Having  no  experience  for  a  guide,  it  was  expected  that  the  unit  cost 
might  be  significantly  higher  (although  the  total  cost  would  in  any 
case  be  small  compared  with  the  monographic  conversion).    The 
knowledge  of  this  as  an  additional  resource  at  about  the  same  cost 
level  seems  highly  propitious. 


Total  Cost— The  total  cost  for  the  conversion  of  all  records,  including 
the  microfilming,  payments  to  CDC  for  their  conversion  processes, 
clerical  wages,  the  purchase  of  magnetic  tapes  for  the  storage  of  the 
final  product,  an  allowance  for  wages  and  computer  time  for  the  edit- 
ing and  reformatting  processes,  plus  an  allowance  for  overhead  to 
the  University,  amounts  to  $18,170.    On  a  per  character  basis,  this 
is  at  a  rate  of  $.0038  per  word  of  5  characters,  or  $.00076  for  each 
character  converted. 


New  Circulation  Control  Procedure 


The  circulation  control  system  outlined  below  was  first  pro- 
posed in  a  Progress  Report  issued  in  June  1964.    Testing  and  de- 
velopment of  the  requisite  means  to  put  it  into  practice  continued  for 
several  months  after  that;  and  in  April  1965,  shortly  after  the  open- 
ing of  the  new  central  library  building  on  the  Johns  Hopkins  campus, 
the  new  system  was  put  into  operation  for  the  Library  as  a  whole. 

Books  brought  to  the  circulation  desk  to  be  charged  out  are  not 
opened.    They  are  placed  together  on  the  counter,  as  many  as  are 
brought  by  a  single  borrower,  and  oriented  so  that  the  book  labels 
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(bearing  the  call  numbers)  all  face  the  same  way.    The  borrower's 
card  is  set  on  a  small  stand  facing  in  the  same  direction  and  the 
books  and  card  together  are  then  recorded  on  microfilm  by  a  suitable 
camera  positioned  a  few  feet  away  (see  Figure  1).    Ten  to  twelve 
books  can  be  photographed  together  in  this  manner,  so  that,  with  few 
exceptions,  all  of  the  books  borrowed  by  one  person  are  recorded  at 
the  same  time.    The  charging  process  is  completed  by  affixing  a  re- 
movable pressure  sensitive  label,  imprinted  with  the  due  date,  on  the 
outside  back  cover  of  each  book  (see  Figure  2).    The  idea  of  the  labels 
was  originated  in  connection  with  the  system  in  order  to  obviate  alto- 
gether the  need  for  opening  books,  so  as  to  speed  up  the  charge -out 
process,  and  to  provide  a  more  easily  noted  reminder  of  the  due  date. 

At  convenient  intervals  the  film  is  developed  in  a  processing 
unit  and  then  placed  on  a  microfilm  reader  situated  alongside  a  key- 
punch machine.    The  reader  currently  in  use  here  projects  the  micro- 
film image  at  a  magnification  which  is  slightly  greater  than  the  re- 
duction ratio  of  the  camera  lens,  so  that  the  image  of  the  books  is 
somewhat  larger  than  life-size.    The  call  number  only  is  read  from 
the  book  label  and  key-punched,  together  with  the  borrower's  card 
number  by  the  operator  (see  Figure  3).     Experience  has  shown  that 
if  the  call  number  can  be  read  on  the  book  itself,  it  can  be  read  from 
the  projected  microfilm  image.    Since  this  number  has  had  to  be  legi- 
ble in  order  for  the  book  to  be  in  place  on  the  shelf,  it  is  generally 
read  without  difficulty;  in  the  occasional  case  of  a  torn  or  mutilated 
label,  however,  identification  can  also  be  made  from  author -title  in- 
formation appearing  on  the  cover  of  the  book,  the  call  number  then 
being  obtained  from  the  Library  catalog.    This  does  not  occur  fre- 
quently. 

The  discharging  of  returned  books  follows  the  same  flow  de- 
scribed above.    The  borrower  drops  books  off  at  the  circulation  desk 
in  the  usual  manner,  and  the  attendant  simply  photographs  them  with 
a  card  marked  "Discharge,"  in  place  of  a  borrower's  card  (see 
Figure  4).    The  key-punch  operator  converts  the  call  number  as 
above,  putting  a  one-character  symbol  denoting  "discharge"  in  the 
place  of  a  borrower's  number.    Renewals  are  handled  in  an  analogous 
manner. 


Capabilities  of  the  System— Given  the  charge  and  discharge  records 
on  punch  cards,  all  of  the  remaining  functions  of  circulation  control 
are  accomplished  by  a  computer  and  computer  program.    The  pro- 
gram was  originated  by  Dr.  Willis  Gore,  Associate  Professor  of 
Electrical  Engineering  of  this  University.    It  performs  the  common 
tasks  which  are  always  associated  with  circulation,  and  in  addition 
has  the  ability  to  do  a  number  of  things  which  seem  highly  desirable 
but  which  have  rarely  if  ever  been  feasible  in  a  large  library. 
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In  the  category  of  common  tasks,  the  computer,  guided  by  the 
program,  keeps  track  of  all  outstanding  charge -outs  and  deletes  the 
record  for  items  returned;  it  determines  when  an  item  is  overdue 
and  prints  out  notices  bearing  the  proper  address,  ready  for  mailing; 
it  calculates  fines,  where  appropriate,  for  items  returned  late;  and  it 
provides  a  readily  accessible  record  of  the  whereabouts  of  all  ma- 
terials in  circulation,  or  in  use  internally  within  the  Library. 

The  capabilities  of  the  system  which  are  new  in  some  sense  do 
not  form  a  well-defined  list,  but  they  include: 

(1)  The  ability  to  compile  circulation  statistics  for  each  item  in 
the  collection  and,  equally  important,  to  present  this  information  in 
concise,  usable  form,  for  any  selected  set  of  material. 

(2)  The  ability  to  characterize  circulation  reliably  in  terms  of 
various  categories  of  borrowers,  the  types  and  subject  areas  of  ma- 
terials borrowed,  periodic  changes  in  rates  of  circulation  and  long- 
term  trends,  or  any  combination  of  the  foregoing. 

(3)  The  ability  to  print  out  at  any  time  a  listing  of  items  charged 
out  by  individual  borrowers. 

None  of  the  capabilities  listed  above  is  new  in  the  sense  that  its 
attainment  is  theoretically  impossible  under  older  systems  of  circu- 
lation control,  but  in  general  the  work  involved  would  be  tedious  and 
time-consuming.    Under  a  manual  card  file  system  or  any  of  its  logi- 
cal derivatives,  the  file  can,  in  the  nature  of  the  case,  be  kept  in  only 
one  order.  Conventionally  this  is  an  arrangement  by  call  number. 
The  file  however  has  many  other  attributes,  or  dimensions,  of  po- 
tential interest:    borrower,  category  of  borrower,  type  of  material, 
main  entry,  time  and  date  of  borrowing,  etc.    To  retrieve  information 
for  any  of  these  other  dimensions,  the  librarian  must  either  make  a 
painstaking  search  through  the  whole  file,  employ  a  system  of  me- 
chanically tagging  or  reordering  throughout  the  file,  or  generate  and 
maintain  a  duplicate  file  for  each  different  approach. 

In  a  computer-based  system,  on  the  other  hand,  the  file  may  be 
regarded  for  practical  purposes  as  multi-dimensional;  roughly  speak- 
ing, using  it  in  any  dimension  does  not  affect  its  integrity  in  any 
other.    There  is,  of  course,  a  cost  associated  with  the  computer's 
operations,  but  the  cost  is  small  relative  to  the  value  which  will  be 
obtained  for  the  capabilities  listed  above  and,  with  the  exceedingly 
rapid  growth  in  the  technology  of  computers,  it  seems  likely  that  this 
will  be  true  for  other  functions  in  the  not  too  distant  future.    In  ad- 
dition, the  "real  time"  taken  by  the  computer  operation  is  negligible, 
in  marked  contrast  to  the  length  of  time  required  for  such  purposes 
under  manual  systems. 


Means  of  Implementation-Encouraged  by  the  Library  staff,  we  em- 
barked upon  the  conversion  of  all  circulation  control  processes  in  the 
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new  Johns  Hopkins  Library  building,  then  in  the  final  stages  of  con- 
struction, to  the  proposed  design.    Our  lack  of  trepidation  was  matched 
only  by  our  lack  of  knowledge  of  the  problems  to  be  encountered.    The 
microfilming  of  the  shelf  list  was  a  considerable  task  but  that  was  a 
straightforward  application  using  a  standard  piece  of  equipment,  quite 
a  different  affair  from  finding,  modifying,  and  fitting  together  pieces 
of  equipment  to  perform  new  functions.    We  have  since  learned  a 
small  amount  about  lenses  and  prisms,  microfilming  equipment, 
films  and  film  processing,  pressure -sensitive  label  materials,  etc., 
but  in  the  main  it  has  been  the  humanity  and  forebearance  of  our  as- 
sociates and  the  Library  staff  which  have  made  the  development  pos- 
sible.   Our  boldest  claim  is  that,  if  the  system  works  with  what  we 
have  put  together  here,  it  should  work  anywhere. 

The  equipment  installed  at  the  circulation  counter  is  in  a  some- 
what different  configuration  from  that  originally  planned.  The  original 
design  contemplated  a  camera  set  in  an  overhead  position,  shoot- 
ing down  on  the  books  and  borrower's  card,  with  the  books  placed 
spine  upward  in  the  usual  case.    The  architectural  design  for  the  cir- 
culation area  in  the  new  Library  building  had,  however,  already  been 
drawn  up  at  that  time,  and  it  turned  out  that  a  configuration  such  as 
that  proposed  would  be  artistically  and  architecturally  unacceptable. 
Accordingly,  the  photographic  recording  process  was  adapted  in  such 
a  way  that  the  books  and  the  borrower's  card  would  be  photographed 
on  the  horizontal  axis.    The  books,  therefore,  are  set  on  the  counter 
on  their  ends,  in  the  position  they  normally  occupy  on  a  shelf,  with  a 
suitable  movable  block  to  hold  them  in  this  position,  while  the  trans- 
action is  recorded. 

In  order  to  avoid  having  the  camera  on  top,  however,  thus  clut- 
tering up  the  new  circulation  counter,  it  was  placed  just  to  the  side 
and  below  the  counter  top  with  a  reflecting  surface  set  at  a  45-degree 
angle  to  direct  the  image  down  into  the  lens  (see  Figure  5).    At  first 
a  front  surfaced  mirror  was  used  for  this  purpose,  but  a  total- 
internal-reflection  prism  does  the  optical  job  at  least  as  well  and  is 
a  great  deal  easier  to  support  in  a  proper  position  and  to  keep  clean. 

In  operation  the  camera  is  completely  enclosed  in  a  housing 
constructed  for  the  purpose,  with  a  removable  side  to  allow  for  the 
insertion  and  removal  of  film.    The  housing  serves  three  purposes: 
it  keeps  the  camera  anchored  firmly  in  position,  since  the  housing  is 
bolted  securely  to  the  supporting  shelf  and  the  camera  is  fitted  to  its 
interior;  it  provides  physical  protection  to  the  camera;  and  it  is  lined 
with  soundproofing  material,  to  reduce  the  slight  noise  of  the  relays 
in  the  camera  during  operation.    The  camera  itself  is  an  automatic 
microfilm  unit  removed  from  its  normal  supports  and  slightly  adapted 
for  the  purpose.    It  is  actuated  by  a  toggle  switch  which  has  been  con- 
nected to  the  operating  relays  and  located  remotely,  convenient  to  the 
location  where  books  are  placed  for  charging.    The  toggle  switch  is 
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spring  loaded,  and  simply  pushing  it  down  takes  the  exposure  and 
automatically  advances  the  film  ready  for  the  next  transaction.    It  is 
an  exceedingly  reliable  piece  of  equipment;  in  addition  to  a  highly  de- 
veloped engineering  design,  it  has  a  comprehensive  alarm  system 
which  gives  a  signal  whenever  the  camera  is  not  in  condition  to  take 
the  exposure  properly— if  the  supply  reel  of  film  is  nearly  exhausted, 
if  the  film  is  not  properly  seated  in  its  channels  when  reloaded,  or  in 
almost  any  other  anomalous  situation  which  might  occur. 

The  film  and  pressure-sensitive  labels  now  in  use  are  still  be- 
ing explored.    The  materials  in  use  do  reasonably  well,  but  work  is 
still  very  much  in  progress  to  improve  their  efficiency  and  per- 
formance.   The  labels  are  secured  with  a  background  design  preprint- 
ed on  them;  the  due  date  is  imprinted  on  them  by  a  small  automatic 
label  imprinter  secured  for  the  purpose,  and  they  are  available  at  the 
circulation  counter  by  an  automatic  dispenser  which  feeds  out  a  new 
label  each  time  a  label  is  removed.    The  cost  is  less  than  a  tenth  of  a 
cent  per  label  at  the  present  time  and  should  be  reduced  with  further 
development. 

All  computer  operations  have  been  carried  out  on  an  IBM  1401. 
Although  there  are  two  IBM  7094' s  in  the  University's  main  comput- 
ing facility,  the  smaller  machine  has  seemed  preferable  for  this  work. 
It  is  located  right  in  the  new  Library  building,  so  that  access  has 
been  excellent;  and  it  is  more  suited  in  basic  design  to  the  type  of 
processing  being  done.    This  may  be  subject  to  drastic  change  soon; 
the  University  has  filed  a  "Letter  of  Intent"  for  an  IBM  360  Model  50, 
to  replace  the  IBM  1401.    The  decision  is  not  yet  firm,  but  the  pros- 
pect is  an  exciting  one;  if  it  is  found  that  the  circulation  control  oper- 
ation can  afford  an  input-output  console  at  the  circulation  desk,  and 
if  a  suitable  time -sharing  package  can  be  included  in  the  computer's 
operating  system,  a  true  on-line  circulation  system  would  seem  to  be 
a  possibility. 

The  IBM  1401  in  use  has  8000  positions  of  core  storage,  four 
729  magnetic  tape  drives  (no  disk  storage),  and  a  fairly  complete  ar- 
ray of  special  features  on  the  central  processing  unit.    The  8K  storage 
limitation  is  a  decided  nuisance,  but  so  far  it  has  been  possible  if 
somewhat  awkward  to  program  around  it.    The  daily  circulation  con- 
trol update  program,  for  example,  has  nine  overlays. 

Programming  has  been  done  entirely  in  the  Autocoder  language. 
In  addition  to  the  basic  circulation  control  package,  there  has  been 
the  task  of  assembling  data  for  and  issuing  library  user  cards;  prior 
to  this,  library  cards  were  not  issued  at  the  University  at  all.    Identi- 
fication numbers  for  circulation  purposes  are  generated  in  the  com- 
puter, and  the  cards  are  prepared  on  preprinted  continuous  forms  on 
the  computer's  printer,  for  the  several  classes  of  users  of  the 
Library. 
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The  amount  of  time  required  for  making  indicated  modifications 
and  changes  in  the  programs  has  been  rather  surprising.    The  origi- 
nal circulation  control  updating  program  was  carefully  made,  and  yet, 
as  experience  has  been  gained,  the  need  for  adaptations  and  the  op- 
portunities to  make  improvements  have  been  virtually  continuous. 
By  far  the  largest  part  of  the  author's  time  for  the  past  several 
months  has  been  devoted  to  writing  programs,  debugging,  and  operat- 
ing the  computer.    On  the  basis  of  this  experience,  it  seems  that  the 
improvement  of  means  for  man- machine  communication  remains  a 
most  important  objective. 
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Figure  1 
A  Frame  of  the  Microfilm  Circulation  Control  Record 
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Figure  2 
Pressure-Sensitive  Label  with  Due  Date 
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Figure  3 
Microfilm  Reader  and  Card  Punch 
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Figure  4 
Returned  Items  in  Position  for  Discharge 
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Figure  5 
Close-up  of  Camera 


THE  USES  OF  PLATO: 
A  COMPUTER  CONTROLLED  TEACHING  SYSTEM* 


Donald  L.  Bitzer,  Elisabeth  R.  Lyman,  and  John  A.  Easley,  Jr. 


The  use  of  a  high-speed  digital  computer  as  a  central  control 
element  provides  great  flexibility  in  an  automatic  teaching  system. 
Using  a  computer-based  system  like  PLATO  permits  versatility  in 
teaching  logics,  since  changing  the  type  of  teacher  merely  requires 
changing  the  computer  program  but  not  the  hardware.    In  addition, 
having  access  to  the  decision -making  capacity  of  a  large  computer 
located  as  one  unit  permits  complicated  decisions  to  be  made  for 
each  student.    Such  capacity  would  be  prohibitively  expensive  to  pro- 
vide by  means  of  decision -making  equipment  located  at  each  student 
station.    Studies  of  queuing  that  occurs  with  multiple  student  requests 
show  that  the  system  could  teach  as  many  as  a  thousand  students 
simultaneously  without  incurring  a  noticeable  delay  in  processing  any 
student's  request. 

The  educational  results  thus  far  have  been  extremely  encourag- 
ing.   However,  reliable  conclusions  on  educational  achievement  must 
await  the  results  of  more  thorough  experiments  now  in  progress  which 
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include  larger  numbers  of  students  learning  under  a  variety  of  con- 
ditions.   The  adaptability  and  usability  of  the  system  for  a  variety  of 
purposes  in  education  (including  the  behavioral  and  physical  sciences) 
have  been  clearly  demonstrated. 


Introduction 


During  the  past  five  years,  the  Coordinated  Science  Laboratory 
at  the  University  of  Illinois  has  developed  and  experimented  with  an 
automatic  teaching  system  called  PLATO  in  order  to  explore  the 
possibilities  of  automation  in  individual  instruction.    The  PLATO 
system  utilizes  a  high-speed  digital  computer  as  the  central  control 
element  for  teaching  a  number  of  students  simultaneously,  while  still 
allowing  each  student  to  proceed  independently  through  the  lesson 
material. 

Three  successive  models  of  PLATO  have  evolved,  each  embody- 
ing improvements  indicated  by  the  previous  model.    The  first  con- 
sisted of  a  single  student  station  connected  to  ILLIAC,  a  medium- 
speed  computer  built  at  the  University  of  Illinois.  4    The  second  model 
had  two  student  stations,  was  connected  first  to  ILLIAC  and  then  to  a 
CDC  1604  computer,  and  was  used  to  study  the  problems  created  by 
multiple  student  use  of  the  system.  1®   The  third  and  current  model 
has  20  student  stations  connected  to  the  CDC  1604  computer. 

The  rules  governing  the  teaching  process  are  included  in  the 
program  read  into  the  central  computer.    A  complete  set  of  rules  is 
referred  to  as  a  "teaching  logic."    The  Coordinated  Science  Labora- 
tory has  experimented  with  two  basically  different  types  of  teaching 
logics,  a  "tutorial"  logic  and  an  "inquiry"  logic.    A  tutorial  logic  is 
designed  to  lead  the  student  through  a  fixed  sequence  of  topics,  but  it 
also  provides  branching  between  problems  (which  is  under  the  stu- 
dent's control,  voluntary  or  involuntary).    In  a  lesson  that  uses  the 
tutorial  teaching  logic,  the  system  first  presents  facts  and  examples 
and  then  asks  questions  covering  the  material  presented.    The  student 
composes  answers,  and  when  he  is  ready,  he  asks  the  system  for  a 
judgment.    If  he  finds  the  questions  too  difficult,  he  may  branch  to 
easier  material.    Involuntary  branching  occurs  when  evaluations  of 
the  student  performance  are  included  in  the  lesson  program,  which 
prescribes  branching  if  predetermined  criteria  are  met  by  the 
student. 

An  inquiry  teaching  logic,  on  the  other  hand,  can  be  character- 
ized as  a  system  permitting  dialogues  between  the  student  and  the 
computer.    Typically,  in  a  lesson  that  uses  an  inquiry  teaching  logic, 
general  problems  are  presented  to  the  student.    To  solve  them,  he 
must  request  and  organize  appropriate  information  from  the 
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computer.    In  such  a  teaching  logic,  the  student  may  be  asked  to 
demonstrate  his  achievement  by  answering  questions,  but  he  may  also 
seek  information  within  a  given  range  of  possibilities  in  order  to 
answer  such  questions. 

Both  types  of  teaching  logics  and  a  variety  of  lesson  materials 
have  been  employed  in  exploratory  studies  in  order  to  test  the  capa- 
bilities of  the  system.    Some  of  these  exploratory  studies  have  in- 
vestigated system  variables  such  as  data  rates  between  the  students 
and  the  system.  2   Other  studies  have  dealt  with  the  psychological  as- 
pects of  the  lessons  and  variations  in  the  teaching  logics.  1>"7,8 


The  PLATO  Teaching  System 


Student  Stations.    A  block  diagram  of  a  single  student  station  in 
the  PLATO  teaching  system  is  shown  in  Figure  1.    The  system  pro- 
vides for  communication  in  two  directions.    Each  student  is  provided 
with  both  an  electronic  keyset  as  a  means  of  communicating  with  the 
central  computer  and  a  television  screen  for  viewing  information  se- 
lected by  the  computer.    The  student's  main  keyset  resembles  a  type- 
writer keyboard,  and  the  keys  can  be  assigned  any  functions  the 
teacher  desires.    Usually,  the  alphanumeric  characters  are  assigned 
positions  similar  to  those  on  a  standard  typewriter  keyboard,  and 
punctuation,  special  characters,  or  special  control  functions  are  as- 
signed to  the  extra  keys. 

Electronic  Book.    There  are  two  sources  of  information  which 
are  usually  displayed  on  the  student's  television  screen.    These 
sources  (called  an  electronic  book  and  an  electronic  blackboard)  are 
diagramatically  shown  in  Figure  1.    The  electronic  book  consists  of 
a  bank  of  slides  prestored  in  an  electronic  slide  selector  which  is 
controlled  by  the  computer.    In  the  latest  model  of  PLATO,  the 
random-access  slide  selector  stores  122  slides  and  has  a  slide  ac- 
cess time  of  less  than  a  microsecond.    Information  stored  in  the  slide 
selector  is  the  type  that  would  usually  be  found  in  a  textbook  or  in 
class  notes.    Although  the  slide  selector  is  shared  by  all  of  the  stu- 
dents, the  students  can  view  the  same  or  different  slides  simultane- 
ously.   This  is  accomplished  by  having  the  video  information  available 
from  all  slides  concurrently,  and  by  connecting  electronically  the 
student's  television  display  to  the  proper  video  output. 

Electronic  Blackboard.    The  electronic  blackboard  consists  of 
a  computer-controlled  storage  tube  for  each  student  station.    Dia- 
grams, symbols,  and  words  are  plotted  in  a  point -by -point  fashion  on 
the  student's  storage  tube.    Approximately  40  alphanumeric  charac- 
ters can  be  written  on  the  student's  blackboard  per  second,  and  the 
entire  blackboard  can  be  erased  in  two  tenths  of  a  second.    This 
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arrangement  permits  information  (that  cannot  be  predetermined)  to 
be  presented  to  the  student,  such  as  information  generated  while 
teaching  the  student.    For  example,  the  system  can  display  a  sketch 
of  an  experiment  the  student  has  requested  or  an  answer  the  student 
has  composed  which  cannot  possibly  be  anticipated.    The  images  from 
the  blackboard  and  the  electronic  slide  selector  are  superimposed  on 
the  student's  television  display,  enabling  the  student  to  fill  in  blanks 
on  the  slide  and  compare  his  answer  with  the  question.    Figure  2 
shows  a  block  diagram  of  two  student  stations,  indicating  the  shared 
and  the  individual  parts  of  the  system.    Information  for  a  student  can 
appear  on  his  television  screen  from  either  the  blackboard  or  the 
book  or  from  both  simultaneously. 


Teaching  Logics  for  the  PLATO  System 


Original  Tutorial  Logic.    The  tutorial  logic  was  the  first  of  the 
two  main  types  of  teaching  logics  explored  on  the  PLATO  system.    In 
this  teaching  logic,  the  keys  were  divided  into  two  types— those  used 
for  inserting  constructed  responses  to  questions  and  those  used  by 
the  student  to  control  his  progress  through  the  lesson  material.    The 
lesson  material  was  organized  into  two  types  of  sequences:    the  main 
sequence,  consisting  of  the  minimum  material  that  must  be  used  by 
all  the  students,  and  the  help  sequences,  provided  for  students  who 
had  difficulty  with  questions  in  the  main  sequence. 

The  student  began  by  viewing  text  material  in  the  main  se- 
quence.   When  he  completed  reading  a  page  of  text,  he  proceeded  to 
the  next  page  by  pushing  the  button  labeled  "continue,"  or  he  returned 
to  a  preceding  page  by  pushing  the  button  labeled  "reverse."    As  the 
student  proceeded  through  the  lesson,  he  was  presented  with  ques- 
tions.   The  teaching  logic  required  that  all  the  questions  on  a  page  be 
answered  correctly  before  the  student  could  continue.    The  student 
was  allowed  as  many  attempts  as  necessary  to  answer  the  question 
correctly.    K  he  had  difficulty  with  a  question,  he  could  push  the  but- 
ton labeled  "help,"  which  took  him  into  a  help  sequence  pertaining  to 
the  question.    After  completing  a  help  sequence,  the  student  automati- 
cally returned  to  the  question  he  was  trying  to  answer  in  the  main 
sequence. 

In  a  later  version  of  the  original  tutorial  logic,  as  developed  by 
Braunfeld,  different  types  of  wrong  answers  called  for  different  help 
sequences.    An  error  detector  was  used  for  automatically  controlled 
branching.    In  addition,  the  later  version  of  the  original  logic  permit- 
ted some  questions  to  be  designated  for  monitoring  by  an  evaluator  in 
the  computer  program.    The  student's  responses  to  monitored  prob- 
lems were  used  to  determine  whether  he  was  branched  forward  to  the 
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next  section  of  the  main  sequence  or  routed  through  material  de- 
signed especially  for  students  who  failed  the  criterion  test  in  the 
evaluator. 

In  order  to  prepare  lesson  material  for  the  original  tutorial 
logic,  one  had  to  organize  the  material  into  a  set  of  slides  (with  at 
least  one  help  slide  for  each  question  in  the  main  sequence)  as  well 
as  prepare  a  parameter  tape.    The  parameter  tape  contained  the  an- 
swers to  the  questions,  their  location  on  the  slide  page,  and  the  order 
in  which  the  slides  were  logically  connected.    If  the  special  help  se- 
quences and  the  evaluator  were  used,  error  categories  had  to  be 
specified  for  the  error  detector  and  a  list  made  of  monitorized  prob- 
lems and  their  criteria  for  evaluation. 

The  most  recent  version  of  the  PLATO  tutorial  logic,  which  is 
much  more  generalized  than  its  predecessors,  will  be  described  in  a 
later  section  of  this  article. 

Inquiry  Logic.    While  the  tutorial  logic  serves  well  for  many 
purposes,  there  are  types  of  problems  in  which  even  more  control 
should  be  given  to  the  student  as  well  as  an  opportunity  to  ask  ques- 
tions of  the  computer.    To  accomplish  this,  the  inquiry  teaching 
logics  were  written. 

An  inquiry  teaching  logic  permits  a  student  to  request  infor- 
mation.   The  computer  correctly  interprets  the  request  and  replies 
from  stored  information  or  calculated  results.    This  logic  provides, 
in  effect,  a  syntax  for  the  student  to  use  in  communicating  with  the 
computer.    The  student  directs  his  learning  by  composing  his  own 
requests. 

In  the  tutorial  logics,  the  student  communicates  with  the  com- 
puter either  through  one  of  the  control  requests— turn  the  page,  judge 
my  answer,  give  me  help— or  he  composes  short  answers  which  usu- 
ally must  match  one  of  the  several  alternative  stored  responses.    If 
he  should  type  a  question  such  as  "What  does  'exponent'  mean?"  the 
computer  would  only  respond  with  a  "no"  since  it  treats  the  student's 
response  as  an  answer.    However,  the  inquiry  logics  provide  a  syntax 
by  which  a  student  can  ask  questions  about  the  lesson  he  is  studying. 
The  syntax  he  uses  can  be  viewed  as  a  tree  of  choice  points  in  which 
selections  are  made  at  each  choice  point. 

Figure  3  presents  a  simplified  flow  diagram  of  a  simulated  lab- 
oratory,  illustrating  the  general  form  of  classification  syntax.    By 
pushing  the  button  labeled  "lab,"  the  student  is  shown  the  general 
categories  of  available  information.    Having  chosen  one  of  these 
categories,  he  is  shown  more  detailed  selections  within  that  category. 
In  general,  successive  subcategories  can  be  chosen  until  the  detailed 
classification  is  specified.    However,  it  is  often  desirable  to  have  the 
major  categories  specified  independently,  e.g.,  object,  conditions  it 
is  exposed  to,  and  particular  properties  about  which  information  is 
desired.    In  such  a  case,  the  student  can  pass  repeatedly  through 
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several  successive  levels  of  selection,  once  through  for  each  general 
category.    Specifications  made  within  one  general  category  can  be 
stored  and  used  in  conjunction  with  those  made  within  another  cate- 
gory.   When  the  requested  information  has  been  completely  specified, 
it  is  displayed  on  the  student's  television  screen. 

Many  variations  on  this  classification  scheme  are  possible. 
Figure  4  shows  how  a  student  might  have  set  up  two  experiments  in  a 
simulated  laboratory  in  which  the  property  to  be  measured  is  chosen 
first  and  the  object  and  the  condition  specified  later.    The  properties 
about  which  information  can  be  obtained  are  the  weight  and  overflow 
volume  of  objects  listed.    The  conditions  available  are  the  liquids  in 
which  an  object  is  immersed.    Figure  4  also  illustrates  the  use  of 
both  graphical  and  numerical  display  of  results.    Figure  3  shows  only 
two  choices  at  each  choice  point,  and  Figure  4  shows  two  for  the  first 
choice  and  five  for  each  of  the  next  two  in  the  volume  experiment  (and 
six  and  seven  respectively  for  each  of  the  next  two  in  the  weight  ex- 
periment).   Figure  3  illustrates  only  four  specifications  on  each  pass 
through  the  tree,  but  two  passes  provide  16  combinations  of  objects 
and  properties.    Figure  4  provides  50  possible  specifications  on  one 
pass  through  the  volume  experiment  and  84  through  the  weight  experi- 
ment.   Some  combinations  have  been  used  involving  two  passes.    Ten 
choices  at  each  choice  point  on  three  levels  would  permit  a  thousand 
specifications  on  each  pass  through  the  tree.    PLATO  III  permits  128 
selections  at  each  of  128  choice  points,  which  should  be  more  than 
adequate  for  any  foreseeable  educational  purposes. 

Specification  at  choice  points  may  seem  a  somewhat  artificial 
way  of  asking  a  question,  but  it  resembles  the  way  one  locates  mer- 
chandise in  a  department  store,  and  even  elementary  school  children 
adapt  to  it  easily.    It  requires  only  a  slight  rearrangement  of  ordinary 
language.    For  example,  instead  of  typing  "What's  the  effect  of  ad- 
ministering nitroglycerine  on  the  heart  rate  of  the  patient?"  the  stu- 
dent in  a  PLATO  teaching  program  for  nurses  who  wished  to  ask  this 
question  typed  coded  numbers  for  the  following  sequence  of  phrases: 
return  patient  to  original  state,  give  drugs,  select  nitroglycerine, 
check  condition  of  patient,  vital  signs,  pulse  rate.    At  this  point  the 
computer  answered  with  the  pulse  rate.  6    Students  quickly  learned  the 
syntax  required  and  usually  formed  such  coded  questions  more  rapid- 
ly than  they  could  type  them  in  English.    The  computer  responded  im- 
mediately, displaying  information  obtained  by  computation  or  from 
memory.    The  student  proceeded  to  try  other  experiments  until  she 
was  confident  concerning  the  treatment  of  the  patient. 

An  inquiry  logic  written  for  the  PLATO  system  which  deserves 
special  comment  is  one  that  permits  the  student  to  solve  mathemati- 
cal problems  that  require  many  lines  of  work  and  for  which  all  pos- 
sible solutions  cannot  be  anticipated.  9    In  this  teaching  logic,  the 
student  is  informed  whenever  he  violates  any  of  the  rules  of 


40 

mathematical  logic.    The  computer  does  not  store  a  set  of  correct  so- 
lutions, but  it  does  store  the  mathematical  principles  available  to  the 
student.    The  rules  of  mathematical  logic  are  built  into  this  teaching 
logic  by  means  of  decision  programs.    Thus,  this  logic  simulates  a 
teacher  who  watches  students  at  work  and  tells  them  whenever  they 
make  an  error  but  doesn't  tell  them  what  they  should  have  written. 
The  student  is,  in  effect,  asking  whether  each  move  he  proposes  is  a 
valid  one,  a  question  to  which  he  gets  an  immediate  reply. 

The  PLATO  Compiler.    A  PLATO  compiler  was  developed  in 
1964  which  permits  simple  preparation  of  all  types  of  new  teaching 
logics.    With  this  compiler,  educational  researchers  have  prepared 
several  new  teaching  logics  suited  to  their  own  purposes  in  fields 
ranging  from  mathematics  to  the  behavioral  sciences. 

Preparing  an  inquiry  type  teaching  logic  requires  specifying 
the  tree  structure  of  the  syntax  the  student  uses  to  communicate  with 
the  computer.    Preparing  a  tutorial  logic  also  requires  specifying  the 
structure  which  the  student  or  teacher  uses  in  communication  de- 
cisions with  the  computer.    The  PLATO  compiler  permits  the  logic 
designer  to  specify  for  each  choice  the  next  choice  point  to  which 
each  response  leads.    Each  choice  point  can  present  a  slide,  some 
message  printed  on  the  blackboard,  operate  a  piece  of  auxiliary 
equipment,  etc.    All  of  these  details  are  specified  in  pseudo-English. 
Special  decision  rules  are  written  as  necessary  using  an  augmented 
Fortran  language. 

All  of  the  PLATO  programs  or  lessons  written  since  the  fall  of 
1964  have  been  written  for  the  compiler.    Many  of  the  old  lessons 
have  been  revised  and  reprogramed  using  the  compiler. 

New  PLATO  Tutorial  Logic.    The  new  PLATO  tutorial  logic, 
written  for  the  compiler,  allows  very  flexible  rules  for  the  teacher. 3 
The  teacher  may  allow  the  student  to  respond  with  long  answers. 
Several  help  sequences  are  permitted,  and  many  judgers  are  avail- 
able, including  a  spelling  judger  (which  prints  "SP"  instead  of  "NO" 
on  the  blackboard  when  a  spelling  mistake  is  made).    Eight  special 
effects  are  available  for  16  different  keys,  such  as  disallowing  certain 
keys  at  specific  times  in  the  lesson  or  introducing  an  inquiry  pro- 
cedure such  as  curve  plotting,  available  upon  student  request.    Spe- 
cial remedial  or  challenge  sequences  are  possible.    A  comment  page 
allows  a  student  to  make  comments  on  the  lesson  at  any  time;  an  in- 
structor page  allows  the  student  to  communicate  with  the  instructor 
via  the  PLATO  display.    Finally,  as  the  most  important  feature,  the 
new  logic  contains  an  author  mode  so  that  the  teacher  may  insert  or 
change  page  answers  and  page  descriptions  on  line  with  the  computer. 

Interconnection  of  Student  Stations.    Although  independence  of 
student  stations  was  initially  thought  desirable,  many  uses  of  station 
interconnections  were  later  suggested.    The  interconnection  was  ac- 
complished with  a  short  addition  to  the  resident  computer  program. 


41 

This  development  has  allowed  teacher -student  interactions,  negoti- 
ation studies,  and  concept  development  exercises. 

New  Logic  for  Problem  Solving.    The  more  generalized  version 
of  the  mathematical  problem- solving  logic  is  being  written  with  the 
use  of  the  compiler.    This  logic,  incorporating  improvements  indicat- 
ed through  experience  with  the  prototype,  will  allow  the  student  to 
formulate  his  own  problems  and  conjectures  and  work  them  out  with 
the  same  supervision  as  if  they  had  been  problems  stored  by  the 
author  of  the  lesson.    The  judgment  of  student  errors  could  be  post- 
poned, if  desired,  until  the  student  requests  that  his  work  be  marked. 
It  is  expected  that  this  logic  will  be  able  to  cope  with  problems  in 
elementary  algebra,  logic  and  set  theory,  and  some  portions  of  ge- 
ometry. 

Student  Records.    One  of  the  important  features  of  the  PLATO 
system  is  the  "perfect  workbook"  of  student  performance  kept  by  the 
computer.    The  student  records  include  a  record  of  each  button  pushed 
and  the  time  at  which  it  was  pushed.    This  information  is  available  in 
two  forms:    one  form  is  a  printed  history  of  events  that  can  be  im- 
mediately scanned  by  the  teacher;  another  form  is  one  stored  on  mag- 
netic tape  that  can  be  processed  by  the  computer  for  a  detailed  sta- 
tistical analysis. 


Exploratory  Studies  Using  the  PLATO  System 


Student  Performance  and  Queuing  Studies.    Several  studies, 
some  of  which  have  already  been  mentioned,  have  been  completed  us- 
ing both  the  tutorial  and  the  inquiry  teaching  logics.    Lesson  material 
drawn  from  mathematics,  computer  programing,  and  electrical  engi- 
neering initially  were  programed  with  the  original  tutorial  logic. 
Most  of  these  studies  employed  approximately  10  to  12  students  as 
subjects,  each  of  whom  attended  three  of  four  one-hour  sessions. 
Results  of  some  of  these  studies  are  available  in  another  report. 2 

Briefly,  the  results  from  the  early  investigations  showed  the 
following:    (a)  There  was  no  significant  difference  between  the  post- 
test  scores  of  students  who  received  instruction  via  the  PLATO  sys- 
tem and  those  who  attended  regular  class.    However,  the  amount  of 
time  spent  on  the  lesson  material  was  significantly  less  for  the  stu- 
dents working  on  PLATO;  and  (b)  using  over  50,000  student  requests 
obtained  with  the  lesson  material,  queuing  studies  were  performed. 
It  was  determined  that  a  general  purpose  computer,  having  a  high- 
speed capacity  of  1.5  million  bits,  would  allow  1,000  students  to  be 
tutored  concurrently  on  eight  different  lessons  without  incurring  a 
noticeable  delay  on  any  student's  request. 
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University  Courses.    Recently,  the  new  PLATO  tutorial  logic 
was  used  to  program  half  of  the  material  for  a  semester's  work  in  a 
course  in  circuit  analysis  offered  to  electrical  engineering  junior 
and  senior  students.    Although  no  detailed  analysis  or  evaluation  of 
the  students'  responses  was  made,  some  of  the  more  obvious  results 
showed  that  the  students  appreciated  flexibility  in  the  system,  enjoyed 
features  such  as  curve  plotting,  and  thought  the  course  material  was 
markedly  clarified  by  the  PLATO  lessons.    At  present,  the  logic  is 
being  used  for  credit  courses  in  "How  to  Use  the  Library"  and  "For- 
tran Programming  for  Business  and  Commerce  Students"  as  well  as 
for  the  electrical  engineering  course.    Evaluation  of  student  perform- 
ance will  be  made  from  the  detailed  records  provided  from  the 
system. 

Text  Testing.    Worthy  of  mention  is  a  study  now  in  progress 
which  uses  a  logic  basically  tutorial  in  nature  to  record  performance 
of  students  as  they  test  new  textbooks.    The  student  works  freely 
through  a  textbook,  which  is  reproduced  on  the  PLATO  system,  an- 
swering problems  or  questions  at  will.    The  on-line  author  input  al- 
lows on-the-spot  changes  and  revisions  by  the  author.    Data  retrieval 
programs  will  give  the  author  a  variety  of  information  useful  in  his 
next  revision. 

Studies  Using  Auxiliary  Equipment.    It  should  be  noted  that  the 
PLATO  system  can  include  auxiliary  devices  operated  under  com- 
puter control.    The  inquiry  training  lesson  used  a  computer -controlled 
motion  picture  projector.  5   Physiological  recording  devices  have  also 
been  used  with  the  system.  1   A  more  unusual  study  is  one  substituting 
a  piece  of  experimental  apparatus  for  a  student  at  a  station,  with  in- 
put from  the  experimental  setup  replacing  the  operator  response  at 
that  station.    A  student  at  a  second  station  can  manipulate  a  real  ex- 
periment through  his  station  without  ever  touching  the  apparatus  and 
can  obtain  the  experimental  results  on  his  display. 

Teaching  with  the  PLATO  system  can  be  extremely  varied  since 
laboratory  as  well  as  classroom  work  is  possible.    Experiments  may 
be  performed  which  are  either  real  (like  those  just  described)  or 
wholly  simulated  (like  those  referred  to  in  the  discussion  of  inquiry 
logics). 

Other  Research.    Other  teaching  research  projects  have  includ- 
ed drill  sequences  for  remedial  arithmetic  studies,  physiological 
studies  relating  to  mathematical  discovery,  1  and  work  in  the  area  of 
verbal  learning  and  retention.    The  wide  range  of  exploratory  studies 
possible  with  the  PLATO  system  serves  to  demonstrate  the  versatility 
and  flexibility  of  the  system.  H 
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EQUIPMENT   DIAGRAM 


(  Electronic  Book ) 


( Electronic  Blackboard ) 


Student 


Figure  1 
Block  Diagram  of  the  PLATO  Teaching  System  (One  Student) 


Figure  2 

Block  Diagram  of  the  PLATO  Teaching  System  Showing  Shared  and  Individual 

Parts  of  the  System 
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Store  Store 

i     i 


Store  Store    Ans.  Ans. 

i     i  i     i 


Ans.  Ans. 


Figure  3 

Simplified  Flow  Diagram  for  Simulated  Laboratory  Experiments 
Using  PLATO  in  Inquiry  Logic 
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Figure  4 
Example  of  Student's  Use  of  an  Inquiry  Teaching  Logic 
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APPROACHES  TO  LIBRARY  FILING  BY  COMPUTER 


Jean  M.  Perreault 


"The  essence  of  library  catalogue  is  arrangement  of  entries."! 

(a)  There  is  no  intention,  in  this  paper,  of  providing  anything  like  a 
new  code  of  filing  rules  for  use  with  the  computer.    Ted  Hines  and 
Jessica  Harris  have  made  a  valiant  and  largely  successful  try  at  this 
task.    It  is  recommended  that  you  obtain  it2  and  read  it  thoughtfully. 

(b)  Nor  will  this  paper  comment  on  the  two  classic  American  filing 
codes^  in  such  a  way  that  the  form  subdivision  for  a  subject-heading 
on  this  paper  would  read  "—Commentaries,"  but  rather  in  such  a 
way  as  to  give  it  "—Criticism,  interpretation,  etc."    (c)  Nor  (as  a 
final  disclaimer)  is  this  paper  thematically  concerned  with  the  hope 
for  code  revision— though  these  pages  come  closer  to  such  a  treat- 
ment than  to  (a)  or  (b). 

Instead,  there  will  be  an  attempt  (d)  to  present  some  of  the 
intellectual  or  bibliographical  problems  involved  in  the  notions  of  sort- 
ing and  filing,  and  then  (e)  an  outline  of  some  of  the  tools  and  tech- 
niques which  can  be  brought  to  bear  upon  their  resolution.    Together, 
these  ideas  should  (f)  make  possible  a  rational  basis  for  the  evalu- 
ation of  filing- code -revision  suggestions. 

The  author  is  not  entirely  neutral  in  all  this,  as  those  who  have 
read  my  paper  "The  Computer  and  Catalog  Filing  Rules" 4  should 
know;  but  the  following  discourse  will  subject  that  paper  to  consider- 
able retractatio.    Primary  influences  in  moving  toward  this  revalu- 
ation were  Lubetzky  and  Ranganathan.    (To  avoid  confusion  from  the 
outset,  the  author  does  not  take  "library  filing"  to  include  the  ar- 
rangement of  documents,  but  only  of  their  surrogates.) 

Words— as  anyone  who  has  read,  say,  Finnegan's  Wake,  knows 
—  can  be  a  lot  of  fun.    But  they  can  also  be  quite  a  burden,  particular- 
ly to  whoever  has  the  task  of  laying  them  end  to  end,  or  worse,  one 
atop  the  other,  in  what  could  be  called  a  vertical  rather  than  a  hori- 
zontal order.    All  discourse  is  the  construction  of  "horizontal" 
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strings  of  words;  the  problem  of  filing  is  the  construction  of  "verti- 
cal" stacks  of  words  in  discourse. 

This  last  phrase  should  be  emphasized  because  the  filing  prob- 
lem, that  of  organizing  stacks  of  discursions,  is  far  more  complex 
than  that  of  simple  sorting.    The  ideal,  the  situation  which  could  en- 
able fully  adequate  filing  by  ritualizations  such  as  computers  or  filing 
clerks,  would  be  the  reduction  of  filing  rules  to  sorting  rules. 

Filing  and  sorting  could  be  distinguished  thus:    to  sort  is  to  set 
in  order,  to  file  is  to  interpolate  into  an  already  existing  order  of 
sorted  items.    But,  common  and  easy  to  understand  as  this  may  be, 
it  is  far  too  superficial;  even  in  the  sorting  process  as  defined  here, 
there  must  necessarily  arise  interpolation— unless  the  items  being 
"sorted"  are  somehow  already  in  order.    And,  then,  what  does  "in 
order"  mean?    We  shall  try  to  see  later. 

To  sort  is  to  put  like  with  like.    The  term  was  used  from  the 
beginning  of  printing  shop  practice  to  mean  the  replacing  of  the  ele- 
ments of  the  broken  up  form,  each  element  of  which  belongs  with  its 
own  sort  (type)  in  the  case.    "A"  is  a  sort,  "a"  is  another,  and  all  the 
exemplars  of  each  are  put  back  into  their  appropriate  sector(s)  of 
the  upper-  or  the  lower-case,  and  this  putting  of  sort  with  sort  is 
sorting. 

Words  too  can  be  sorted,  just  as  can  complex  letter-forms. 
There  is  an  interesting  historical  example  of  this  cited  in  Douglas 
McMurtrie's  history  of  printing  The  Book,  5  where  it  is  pointed  out 
that  the  Chinese  idea  of  moveable  type  (for  printing  in  the  Uigur 
language)  was  for  the  elementary  pieces  each  to  print  a  word,  although 
Uigur  is  a  language  where  words  were  built  up  from  more  elementary 
particles,  namely  letters.    In  our  literal  Western  languages  sorting 
is  basically  the  ordering  of  the  smallest  elements,  namely  letters; 
but  it  is  intriguing  to  try  to  imagine  how  an  order  of  irreducible 
words  could  be  set  up.    Let  us  not  forget,  however,  that  our  own  alpha- 
betical (letter)  order  is  every  bit  as  arbitrary,  although  admittedly 
shorter  than  would  be  the  (word)  order  of,  say,  the  Mandarin  vocabu- 
lary.   Like  the  Chinese,  a  Western  printer  might  well  consider  letter- 
complexes  such  as  "a,"  "a",  and  "a"  to  constitute  three  additional 
sorts  besides  plain  "a,"  but  the  breakdown  into  simple  elements  could 
go  further,  to  "a,"  a/",  u^",  and  u/^",  so  that  these  accentual  elements 
could  be  freely  combined  with  other  ordinary  letters  (as  for  instance 
"' "  over  "n"  in  Polish).    By  this  technique,  the  total  number  of  pos- 
sible sorts  is  reduced,  and  this  is  only  the  logical  extension  of  the 
reduction  of  the  sorting  of  words  themselves  by  means  of  the  sorting 
of  their  elements,  letters. 

By  "words",  here,  is  meant  each  word— all  the  words  of  the 
dictionary  taken  individually.    Discourse,  where  words  are  not  taken 
individually,  is  quite  another  matter,  because  discourse  cannot  be 
standardized.    It  is  this,  in  fact,  that  makes  language  fun,  and  it  was 
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Joyce's  refusal  to  accept  the  standardization  of  words,  too,  which 
makes  Finnegan's  Wake  fun. 

We  can  establish  "sort-boxes"  for  everything  from  the  elemen- 
tary parts  of  an  alphabet,  or  of  all  alphabets,  up  to  the  elementary 
parts  (words)  of  a  language.    But  how  can  the  cross-over  from  this 
world  of  sorting  to  the  quite  different  one  of  filing  be  accomplished  ? 

First  of  all,  since  it  has  been  postulated  that  it  is  non- 
standardizability  that  sets  off  the  discursive  from  the  sortably  stand- 
ardizable,  we  must  try  to  see  what  it  is,  as  far  as  order  is  concerned, 
that  constitutes  discourse.    In  answer  to  this  I  would  suggest  articu- 
lation, the  same  sort  of  phenomenon  that  lets  a  plurality  of  lines  form 
various  2-  or  3-dimensional  figures  by  coming  together  at  a  variety 
of  angles.    Because  of  this  variability,  any  two  words  can  form  as 
many  patterns  of  discursive  meaning  as  the  language  will  allow— as  is 
demonstrated,  for  instance,  in  the  farce  line  "But  you  don't  even  know 
I'm  alive"  or  in  a  great  many  puns.  6 

The  usual  way  of  handling  these  points  of  articulation  between 
words  is  commonly  expressed,  among  librarians,  as  "nothing  before 
something,"  since  the  juncture  between  words  is  most  commonly 
shown  by  a  blank,  a  "nothing."    This  is  true  even  in  spoken  language, 
where,  although  the  blanks  between  words  are  not  always  clearly  per- 
ceptible, at  least  those  representing  phrase -points  normally  are. 
These,  indeed,  are  the  historical  antecedents  in  the  development  of 
our  present-day  rules  of  written- language  punctuation.^   The  reduc- 
tion of  the  principle  "nothing  before  something"  to  a  purely  sorting 
process  is  accomplished  by  treating  the  articulating  blank  as  a  part 
of  the  collating  sequence,  so  that  the  sequence  reads  "blank,  a,  b, 

n 

But  there  are  other  kinds  of  articulation  in  "horizontal"  dis- 
course than  that  constituted  by  the  blanks  between  words.    There  are, 
as  mentioned,  in  spoken  discourse:    intonation,  tempo,  accent,  etc.; 
in  written  discourse:    punctuation,  paragraphing,  abnormal  capital- 
ization, italicization,  etc.    Although  it  is  clearly  possible  to  take  these 
into  account  in  our  filing  codes— by  the  device  explained  in  "The  Com- 
puter and  Catalog  Filing  Rules''^  for  instance— which  device  leads  to 
phrase -by-phrase  filing  in  addition  to  the  word-by-word  filing  en- 
abled by  the  use  of  the  blank  in  the  collating  sequence— the  real  diffi- 
culty is  the  same  as  that  encountered  in  mechanical  translation  and 
in  classification  for  information  retrieval  with  machines  as  the 
clerks.    This  central  problem  is  ambiguity.    It  does  not  require  much 
reflection  to  see  that  if  each  word  in  a  language  could  denote  one 
concept  and  one  only,  any  word  or  words  in  that  language  could  be 
easily  enough  translated  into  another  similarly  characterized  lan- 
guage, provided  that  the  conceptual  underpinning  and  partitioning  of 
the  two  languages  was  the  same.    That  is,  if  there  is  no  word  in 
Eskimo  for  "to  lie",  then  the  English  verb  "to  lie"  cannot  be  adequate- 
ly translated  into  Eskimo  even  if  the  structural  similarity  is  there. 
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But  in  fact,  languages  are  made  up  of  words  which  are  not  univocal,8 
and  whose  equivocacy  becomes  even  more  marked  in  discursive  situ- 
ations, 9  since  there  the  context  sorts  out  the  one  sense  among  the 
several  as  appropriate.    Take  the  phrases  "a  man  of  iron",  "a  feature 
of  the  land",  "a  man  of  Florence",  "a  book  of  John's."    "Of"  in  these 
examples  means  successively  "with  the  characteristic,"  "inherent  in," 
"stemming  from,"  and  "belonging  to."    A  precisely  similar  equivocacy 
can  be  seen  in  our  marks  of  punctuation.    (This  is  not,  of  course,  to 
say  that  every  preposition  or  mark  of  punctuation  is  equally  burdened 
with  equivocacy,  but  that  some,  such  as  "of"  and  the  comma,  are 
made  to  do  a  far  greater  share  of  the  work  than  seems  fair.)  10 

Now  all  this,  in  terms  of  the  problem  of  filing,  is  incarnate  in 
our  desire  to  avoid  purely  mechanical  rules  and  in  our  tendency  to 
prefer  an  ordering  of  discursive  elements  in  the  same  way  that  we 
articulate  them  in  speech,  in  some  way  which  transcends  the  purely 
temporal  order  of  the  sounds  themselves.    This  tendency,  or  at  least 
something  faintly  caricatural  of  it,  is  to  be  seen  at  work  in  the  Ameri- 
can Library  Association  and  the  Library  of  Congress  filing  rules. 
And  lest  we  imagine  that  present-day  Americans  are  alone  in  this 
tendency,  bad  or  good,  see  the  Prussian  Instructions,  part  II.  H    In 
all  these  bodies  of  rules,  we  see  a  variety  of  solutions  to  the  fact  that 
in  spoken  discourse  words  can  be  variously  emphasized,  so  that  the 
usual  order  presumed  as  the  basis  of  all  filing  is  belied.    This  pre- 
sumption is  that  the  first  element  is  always  of  higher  importance  than 
the  second,  and  so  forth.  12 (This  point  is  given  the  name  "Canon  of 
Prepotence"  by  Ranganathan.   Unfortunately,  he  forgets  to  indicate 
that,  national  as  this  position  or  unconscious  policy  may  be,  it  is  not 
the  only  one.) 

This  presumption  should  be  examined,  if  only  briefly.    If  there 
are  two  Dewey  numbers,  say  309.99  and  310,  their  filing  order  is  de- 
termined by  positional  comparisons  which  accept  the  presumption 
mentioned  above.    The  same  is  true  as  well  for  such  a  pair  as  009 
and  100,  where  the  temptation  would  be  strong  for  a  naive  person  to 
put  '!'  earlier  than  '9'  and  not  worry  about  the  zeroes.    But  there  is 
a  different  kind  of  number,  one  encountered  even  more  frequently  in 
real  life:    the  integer  as  against  the  decimal  number.    In  this  case, 
two  numbers  are  compared  not  on  a  first-to-first-element  basis, 
and  so  forth,  but  in  the  exact  reverse;  here  30999  comes  after,  not 
before,  310. 

Now  although  rules  are  needed  for  sorting  of  numbers,  the  ver- 
bal sorting/ filing  problem  is  far  more  difficult,  or  at  least  forms 
much  the  largest  problem-sector  in  filing  practice,  except  in  the 
case  of  LC  call  numbers.    There  the  presence  of  both  decimal  num- 
bers and  integers  presents  problems  even  to  some  librarians.    And 
the  interesting  point  is  that  sorting  rules  for  words  almost  invariably 
operate  according  to  the  decimal  sorting  presumption.    No  one  would 
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suggest  that  "mad"  should  come  after  "management"  just  because 
*-ent"  is  lower  in  the  collating  sequence  than  "mad"  except,  of 
course,  in  the  organization  of  a  rhyming  dictionary.    Yet  there  are 
cases  where  something  close  to  the  integer -sorting  algorithm  is  at 
work  with  words,  for  instance  where  initial  words  such  as  articles 
are  assumed  to  be  of  no  value  in  the  sorting  order.    (This  tendency 
could,  alternatively,  be  compared  with  the  leading  zeroes  which  get 
"9"  in  009  to  file  ahead  of  "1"  in  100.    "The"  at  the  head  of  a  title  is 
regarded  as  analogous  to  such  ciphers.) 

If  sorting  is  the  linear  arrangement  of  simple  or  pseudo- simple 
inscriptions,  and  filing  is  the  linear  arrangement  of  semantic  and  dis- 
cursive inscriptions,  with  all  the  difficulties  implied,  and  if  the  goal 
is  to  make  the  problem  of  filing  simple  enough  to  be  performed  by 
clerks,  human  or  mechanical,  then  we  must  seek  for  means  to  re- 
duce these  semantic-discursive  aggregates  to  at  least  pseudo- simple 
ones.    There  are  several  tools  or  techniques  for  this  task,  ranging 
through  quite  a  spectrum  both  in  the  complications  of  input  required 
and  in  the  output  achieved. 

If  a  clerk  is  given  two  entries  to  file  relative  to  one  another, 
one  reading  "Bible.    English.    1964.    Goodspeed."  and  the  other  read- 
ing "Bible.    English.    Knox.,*  the  second  files  before  the  first  not  be- 
cause letters,  represented  here  by  the  "K"  in  "Knox"  comes  before 
the  "1"  in  "1964",  but  because  the  Knox  version,  date  unknown,  would 
come  among  the  first -order  group  of  English  (whole)  Bibles  with  un- 
specified date-facets.    (In  such  a  case  as  this,  a  perhaps  unexpected 
reflection  of  the  rules  for  inverted  file-order  as  derived  from  a 
citation -order  can  be  seen  quite  clearly;  see  the  Appendices  for  a 
fuller  discussion.) 

But  for  such  an  effect  to  be  produced  there  must  be  instructions 
to  the  filing  clerk  that  predict  such  possible  variations,  instructions 
such  as  "The  facet  after  'language'  should  be  'date';  if  not,  treat  it  as 
a  blank  date-facet,  filed  before  all  full  date-facets."    Essential  in  all 
this,  if  a  clerk  such  as  the  computer  is  employed,  is  that  each  facet 
of  such  an  entry  be  unambiguously  labelled,  so  that  the  problem  never 
occurs  of  the  computer-clerk's  not  being  able  to  "understand"  the 
elements  which  it  is  called  upon  to  manipulate. 

Two  instances  of  the  same  sort  of  discursive  articulation  can 
be  seen  in  comparison  (a)  of  such  a  pair  of  titles  as  The  Man  without 
Qualities  and  Man;  Zephyr  or  Clod.  Assuming  that  a  dictionary  catalog  is 
being  considered,  the  second  of  these  will  not  be  entered  at  all  unless 
the  explanatory  sub-title  is  included,  since  otherwise  it  will  be 
dropped  because  of  a  match  with  the  subject  heading  apparatus.    But 
if  they  are  both  entered  and  are  to  be  filed  relative  to  each  other,  the 
phrase-point  introduced  by  the  semi-colon  cannot  well  be  ignored  lest 
the  title  be  found  only  by  someone  who  could  imagine  a  title  beginning 
(as  if  "man"  were  an  adjective)  Man  Zephyr.  .  .  .    Thu*s,  although 
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"w"  in  "without"  precedes  "Z"  in  "Zephyr",  in  the  collating  sequence, 
the  phrase  "Man",  being  shorter  than  "The  Man  without  Qualities", 
should  probably  file  first— just  as  a  short  word  like  "man"  files  be- 
fore a  long  one  like  "management",  and  a  short  title  like  The  Man  be- 
fore a  long  one  like  A  Man  and  his  Dog.    It  can  also  be  seen  in  the 
order  of  (b)  such  a  pair  of  entries  as  "Mass— Meditations"  and  "Mass 
(Canon  law)."    The  parenthetical  qualifier  modifies  the  initial  word  in 
such  a  way  as  to  make  it  represent  a  new  concept,  a  longer  one  than 
the  same  word  in  its  unqualified  sense,  even  though  the  unqualified 
one  is  extended  by  a  subdivision— and  despite  the  priority  of  the  "C" 
in  "Canon"  to  the  "M"  in  "Meditations."    While  something  more  than 
pure  "alphabetic"  or  "symbolic" order  is  desired,  there  is  an  ineluc- 
table dependence  of  ^11  orders  upon  that  which  is  "matter"  to  their 
"form"— in  this  case  the  ";"  or  "— "  or  "(." 

Thus  there  is  a  definite  justification  for  the  traditional  attitude 
of  librarians  that  absolute  adherence  to  the  letters  (or  symbols)  alone 
of  an  entry  is  not  enough  to  generate  an  intelligible  and  useful  file. 
"Order"  is  that  which,  hopefully,  is  generated  by  a  file;  just  what 
state  of  affairs  it  is  that  can  be  characterized  by  "in  order"  is  not 
clear. 

Fugmann  defines  order  as  "The  meaningful  contiguity  (Beiein- 
andersein)  of  the  parts  of  a  whole.  "13    But  this  seems  to  imply  per- 
haps too  forcefully  the  naturalness  of  order;  an  organ  in  a  body  has 
such  a  part  to  play  in  the  whole,  but  a  word  in  a  list  of  words  is  not 
related  thus  to  the  others.    A  bit  more  neutrally  then,  one  might  de- 
fine order  as  "being  where  one  ought  to  be."    Thus  "2"  ought  to  be 
after  "1",  and  K3"  next;  but  the  reason  for  "b"  being  after  "a"  is  not 
quite  the  same.    The  order  of  letters  is  arbitrary,  not  as  with  num- 
bers, natural.    Yet  even  this  last  is  not  entirely  true,  since  the  num- 
ber sequence  could,  if  not  radix- 10,  be  quite  other  than  is  "naturally" 
expected.    Thus  it  can  be  seen  that  it  is  finally  the  concepts  referred 
to  or  represented  by  (1)  the  numbers  not  as  mere  digits,  but  as  real 
numbers  or  (2)  letters  not  as  mere  digits,  but  as  elements  of  words, 
which  are  the  basis  for  the  order  of  a  file— rather  than  that  of  a  sort- 
ing, which  depends  entirely  on  "form"  rather  than  "content."    Or  we 
could  say,  to  follow  the  earlier  terminology,  "matter"  rather  than 
"form." 

Even  if  there  seems  to  be  no  real  order  except  that  of  meaning, 
and  if  there  remains  a  mixture  of  arbitrary  and  part-natural  types, 
there  is  a  common  factor  at  work,  namely  that  they  both  make  pre- 
diction possible.    The  difference  between  sorting  and  filing  comes 
sharply  to  the  surface  when  the  predictability  to  be  found  in  each  is 
sought.    That  of  sorting  is  absolute,  within  the  allowable  limits  of  the 
decimal  or  integral  presumption.    But  that  of  filing  is  only  partial, 
(a)  because  of  its  dependence  upon  the  meaning  of  the  words  (seman- 
tics), and  (b)  because  of  the  meaning  of  the  discourse  constituted  by 
these  words  as  inter-related  (syntax). 
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The  essential  ideas  to  keep  in  mind  are: 

(la)  Filing  and  sorting  are  not  identical,  nor  do  they  refer  to 
different  stages  of  a  single  process.    Sorting  is  putting  like  with  like, 
whereas  filing  is  the  creation  of  a  series  of  complexes  by  means  of 
partial  likenesses. 

(Ib)  Discourse  is  what  is  to  be  filed,  in  libraries,  and  discourse 
is  made  possible  by  the  articulation  of  semantemes  (meaning-bearers) 
by  syntax. 

(Ic)  The  result  of  this  articulation,  as  well  as  of  the  character 
of  many  semantemes,  is  ambiguity.    Even  greater  ambiguity  charac- 
terizes a  complex  that  is  not  articulated:    thus  our  traditional  and 
quite  rational  desire  to  retain  as  much  as  possible  of  the  benefit  of 
syntactic  articulation. 

(2a)  There  are  two  presumptions  from  which  to  choose  in  build- 
ing a  file:    either  the  decimal  or  the  integral.    Numbers  can  be  filed 
either  way;  words  usually  can  be  filed  only  decimally; 

(2b)  Means  are  needed  to  convert  some  words  into  "leading 
zeroes,"  as  if  the  integral  presumption  were  operative. 

(3)  The  most  general  principles  for  filing  are  two:  (a)  "nothing 
before  something,"  which  can  also  be  elaborated  into  the  principle  of 
inversion  of  the  citation-order,  and  (b)  order  as  predictability. 


Some  Tools  and  Techniques  of  Order  Building 


There  can  be  two  polar -antithetical  positions  taken  with  regard 
to  the  order  of  a  file  of  complex  verbalisms:    the  purely  symbolic, 
and  the  purely  eidetic.    There  are  variations  within  each  such  pole, 
of  course,  so  that  within  the  adherents  of  eidetic  order,  there  are 
(a)  those  who  want  nothing  but  classification,  "Out  with  words  alto- 
gether!," (b)  those  who  want  chain -indexing  and  the  like,  and  (c)  those 
who  favor  a  strictly  subjective  or  semantic  interpretation  of  the 
words  and  punctuations,  especially  of  subject  headings.    Also,  within 
the  adherents  of  symbolic  order,  there  is  the  traditional  disagreement 
over  the  recognition  of  the  blank:    (a)  recognizing  it  yields  word-by- 
word order,  (b)  ignoring  it  yields  letter-by-letter  order.    Nor  are 
these  all  the  possible  shades  of  divergence. 

The  scope  of  the  present  inquiry  must  accordingly  be  restricted, 
lest  the  luxuriance  of  detail  make  the  presentation  of  the  polarity  un- 
intelligible; the  appendices,  however,  contain  additional  matter. 
Classification  as  an  alternative  to  the  subject  heading  catalog  will 
not  be  considered  even  though  I  favor  it  myself.    Nor  will  the  chain- 
index.    Nor  will  letter -by- letter  filing  be  admitted  as  an  acceptable 
device.    There  still  remains  a  polarity  between  the  symbolic  and  the 
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eidetic;  and,  as  stated  earlier,  what  should  be  sought  is  the  reduction 
of  filing  rules  (as  called  for  by  the  eidetic  nature  of  the  items  being 
manipulated)  to  sorting  rules  (as  called  for  by  the  symbolic  embodi- 
ment of  these  items). 

The  reason  for  this  reduction  is  the  need  for  univocity.    The 
computer  "knows"  nothing;  it  cannot,  as  Fairthorne  quips,  find  just 
the  important  words,  because  for  a  word  to  be  important  it  must 
mean  something.    The  computer  can  only  manipulate  words;  meaning 
means  nothing  to  it.    So  the  equivocacy  inherent  in  discourse  must  be 
avoided;  and,  if  this  can  be  accomplished,  the  computer  is  left  with 
things  to  manipulate  that  it  can  handle  (without  needing  to  understand 
their  meanings). 

The  two  polar  -antithetical  positions  then  refer  to  the  structure 
of  the  file  that  is  to  be  generated.    As  to  the  means  that  are  to  lead 
to  these  positions,  there  are  at  least  three,  from  which  each  position 
can  choose  at  least  two  as  its  means  of  embodiment.    The  three 
means  are  (a)  the  collating  sequence  as  such,  (b)  the  numerical- 
interpolation  device,  and  (c)  the  "programming"  of  the  cataloger. 

If  there  is  a  sample  set  of  complexes  to  file,  such  as  "WELLS 
(SOMERSET)"  [a  place  as  subject],  "Wells  (Somerset)"  [a  place  as 
author],  "Wells  of  despair"  [a  title],  "WELLS,  HERBERT  GEORGE" 
[a  person  as  subject],  "Wells,  Herbert  George"  [a  person  as  author], 
"WELLS"  [a  topical  subject],  and  "WELLS,  OIL"  [another  topical 
subject],  there  can  be  two  classes  of  order  set  up  amongst  them  in 
accordance  with  the  eidetic  or  the  symbolic  positions: 


SYMBOLICj 

Wells,  Herbert  George  [peA]  WELLS  [tS] 

WELLS,  HERBERT  GEORGE  [peS]  Wells,  Herbert  George  [peA] 

Wells  (Somerset)  [plA]  WELLS,  HERBERT  GEORGE  [peS] 

WELLS  (SOMERSET)  [plS]  Wells  of  despair  [T] 

WELLS  [tS]  WELLS,  OIL  [tS] 

WELLS,  OIL  [tS]  Wells  (Somerset)  [plA] 

Wells  of  despair  [T]  WELLS  (SOMERSET)  [plS] 

As  mentioned,  variations  are  possible  within  each  of  these 
polar  positions;  for  instance,  within  the  eidetic  position  all  authors, 
subjects,  and  titles  can  be  segregated  (the  divided  catalog  approach); 
or,  within  the  symbolic,  distinction  between  single-  and  double  -blanks 
can  be  observed,  giving: 

EIDETIC2  SYMBOLIC2 

Wells,  Herbert  George  [A,pe]  WELLS  [first  word  followed  by 

many  blanks] 

Wells  (Somerset)  [A,pl]  Wells,  Herbert  George 

WELLS,  HERBERT  GEORGE  WELLS,  HERBERT  GEORGE 

[S,pe] 
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WELLS  (SOMERSET)  [S,pl]  WELLS,  OIL  [first  word 

followed  by  double - 
blank] 

WELLS  [S,t]  Wells  (Somerset) 

WELLS,  OIL  [S,t]  WELLS  (SOMERSET) 

Wells  of  despair  [T]  Wells  of  despair  [first  word  fol- 

lowed by  single -blank] 

Now,  consider  the  means  for  achieving  these  various  results. 
The  collating  sequence,  (a)  may  have  too  much  expected  of  it  to  man- 
age the  eidetic  orders,  at  least  without  nominal  assistance  from  (c), 
the  "programming"  of  the  cataloger.    The  computer,  in  other  words, 
can  set  first  in  order  "Wells,  Herbert  George"  as  author  and  then  as 
subject,  because  there  is  a  difference  in  the  characters  of  which  each 
is  composed— just  as  the  filing  clerk  can  tell  apart  black  entries  for 
authors  and  red  entries  for  subject.    But  it  can  scarcely  be  expected 
to  put  places  ahead  of  topics  unless  it  has  available  a  ridiculously 
large  table  of  place  names  for  reference.    And  that,  in  any  case,  is 
dependence  on  the  cataloger  too,  as  is  depending  on  the  tracing- group 
number-families  to  distinguish  authors,  titles,  and  subjects. 

The  numerical-interpolation  device,  (b)  on  the  other  hand,  can 
achieve  any  desired  order  at  all,  but  while  it  may  be  considered 
reasonable  for  authors  and  subjects— which  can  be  counted  on  to  re- 
turn with  some  degree  of  frequency  15_ it  is  assuredly  not  so  with 
titles,  which  can  be  counted  on  to  be  almost  invariably  varied,  thus 
requiring  a  new  number  for  almost  every  document.    The  numerical- 
interpolation  device  seems  to  me  to  have  no  place  in  the  symbolic  po- 
sition, which  by  definition  should  not  call  upon  any  outside  assistance, 
at  least  not  beyond  that  which  the  cataloger  does  anyway,  such  as  as- 
signment of  tracing-group  number-families.    (It  does,  however,  have 
a  very  definite  place  in  several  clerical  operations  such  as  the  peri- 
odical check-in  file.)  16 

A  tiny  matrix  can  be  constructed  to  show  the  intersection  of 
these  two  sets  of  ideas  (positions  and  means),  with  checks  indicating 
mutual  appropriateness: 

EIDETIC 

.  SYMBOLIC 

COLLATING  SEQUENCE  ALONE 
NUMERICAL  INTERPOLATION 
'PROGRAMMED'  CATALOGERS 

The  "programming"  of  the  cataloger,  (c)  the  only  one  of  the 
three  devices  necessarily  involved  with  both  the  eidetic  and  the  sym- 
bolic positions,  enters  unrecognized  even  in  the  apparently  pure 
collating- sequence  technique  as  it  is  what  distinguishes  between 
otherwise  identical  entries.    But  it  has  a  larger  role  to  play  in 
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situations  of  a  higher  complexity  than  those  shown  in  the  sample, 
such  as  the  quasi-facet  structure  of  Bible  entries.    (For  greater  de- 
tail, see  the  Appendices  and  the  end  of  this  section.) 

It  is  well  to  detail  the  actual  operations  required  by  each  of 
these  three  techniques  before  attempting  to  proceed  to  an  evaluation 
of  results  and  a  recommendation  of  goals.    The  collating  sequence 
might,  if  assumed  to  encompass  a  sufficiently  wide  gamut  of  dis- 
criminations, achieve  even  fairly  good  results  in  the  eidetic  as  well 
as  the  symbolic  ordering  of  entries.    However,  it  has  been  my  ob- 
servation* that  the  letters  of  the  alphabet  can  be  called  self- 
justificatory  insofar  as  they  either  transcribe  the  given  elements  of 
the  document  or  embody  the  controlled  vocabulary  decisions  of  the 
cataloger.    This  distinction  between  the  a-scriptive  and  the  de- 
scriptive is  brought  out  more  fully  in  my  paper  "Documentary  Rele- 
vance and  Structural  Hierarchy. "  17   Our  cataloging  rules,  contrasted 
with  the  policies  of  analytical  bibliographers  who  treat  punctuation 
and  the  like  as  being  as  sacrosanct  as  the  words  of  the  transcription, 
take  an  unfortunately  cavalier  attitude  about  punctuation.    The  rules 
avoid  a  clear-cut  decision  between  two  attitudes,  one  of  which  would 
lead  to  leaving  the  punctuation  just  as  it  is  found  (descriptive),  the 
other  to  transforming  given  punctuation  into  functional  (controlled, 
ascriptive).    Thus,  what  the  computer  is  fed,  in  terms  of  guidance 
for  the  application  of  the  collating  sequence  (which,  other  than  the 
alphabet  and  the  numeral  digits,  should  not  be  considered  as  frozen 
forever  by  the  hardware  manufacturers),  is  far  from  the  kind  of  thing 
that  can  help  in  the  decision,  for  instance,  that  parenthetical  expres- 
sions like  "(Somerset)"  create  a  place  sub-group.    To  do  this,  the 
parenthesis  would  have  to  be  used  for  such  a  meaning  and  no  other. 
As  mentioned  before,  codes  of  descriptive  cataloging  do  not  bind  their 
practitioners  to  a  thoroughgoing  usage  of  any  particular  punctuation 
symbol  for  any  particular  function.    Nor,  indeed,  do  codes  for  the 
rendering  of  headings.    To  insist  on  this  consistency  of  usage  would 
be,  to  some  extent,  further  "programming"  of  the  cataloger. 

The  two  alternatives  in  the  use  of  the  collating  sequence  alone 
as  a  filing  device— and  note  that  such  a  device  does  indeed  reduce 
filing  to  sorting— are  (a)  the  superposition  of  the  collating  sequence 
onto  present  descriptive  and  entry  practice,  or  (b)  its  superposition 
onto  descriptive  and  entry  practice  thoroughly  re -worked  for  the  sake 
of  such  results.    The  operative  presumption  here  is  decimal,  so  much 
so  that  even  integral  numbers  would  have  to  be  fitted  out  with  leading 
zeroes  (for  instance,  in  dates  of  the  Christian  era  earlier  than  A.D. 
1000).    The  computer  can  then  operate  upon  these  atomized  elements 


*In  the  course  of  an  investigation  of  the  permissible  compres- 
sion of  book-titles  for  the  Bro-Dart  Foundation.    See  Appendix  IV. 
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(digits)  of  complex  verbalisms,  filing  them  by  sorting  them.    But  it 
must  be  noted  too  that  unexpected  difficulties  can  easily  arise  unless 
cataloging  usage  is  modified,  primarily  in  the  direction  of  a  far  more 
stringent  attitude  about  formats  and  the  like.* 

Use  of  the  standard  IBM  collating  sequence,  for  instance,  in 
which  the  punctuations  in  our  sample  (comma,  parentheses,  blanks) 
are  accepted  as  listed,  gives  (since  the  order  built  into  the  hardware 
is:    blank,  comma,  opening  parenthesis)  SYMBOLIC2-    If  policy  were 
to  dictate,  instead,  SYMBOLIC^,  it  would  be  necessary  to  treat  these 
various  punctuations  as  blanks  to  be  ignored  in  the  creation  of  the 
"sort-tags."    These  then  look  (with  "real"  blanks  indicated  as  "b") 
thus: 

WELLSbbbbb"" 

WellsbHerbertbGeorge 

WE  LLSbHE  RBERTbGEORGE 

Wellsbofbdespair 

WELLSbOIL 

WellsbSomerset 

WE  LLSbSOMERSET 

The  numerical-interpolation  device,  (b)  as  mentioned  is  hardly 
applicable  to  symbolic-order  filing,  since  it  assumes  decisions  about 
order  on  the  part  of  the  cataloger.    These  decisions,  once  made,  al- 
low the  computer  to  proceed  to  a  kind  of  second-order  symbolic-order 
sort.    In  the  earlier  example,  for  instance,  the  policy  decision  could 
be  (instead)  that  EIDETICj  was  the  desirable  order,  and  that  a  span 
of  numbers,  say  5000-6000,  be  allocated  to  all  entries  beginning 
"wells."    The  seven  entries  would  thus  generate  an  interval  of 
1000/7  =  142+,  which  can  be  rounded  out  to  140.    The  seven  entries 
are  accordingly  assigned  numbers  to  "fossilize"  their  policy- 
determined  order: 

5070      Wells,  Herbert  George  5490      WELLS  (SOMERSET) 

5210      WELLS,  HERBERT  5630      WELLS 

GEORGE 

5350      Wells  (Somerset)  5770      WELLS,  OIL 

5910      Wells  of  despair 

Later  additions  to  the  file  are  assigned  such  numbers  as  will 
intercalate  them  in  the  place  appropriate  to  the  policy  (for  instance, 
"WELLS  FAMILY,"  5210  +  70  =  5280).    These  numbers,  once  as- 
signed, are  either  (a)  automatically  added  to  each  entry  by  the  com- 
puter from  a  look- up  table  (a  program  arrangement  can  provide  for 

*Daniel  Gore's  character,  Melissa  Spindrift,  has  a  point  in  her 
fanatical  insistence  on  obedience  to  trivial  rules— at  least  in  terms 
of  the  needs  of  computers.    (See  Reference  18.) 
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signalling  the  cataloger  whenever  an  unassigned  entry  is  input),  or 
(b)  input  by  the  cataloger  on  the  basis  of  a  reference  table,  thus  sub- 
stituting for  the  verbal  entry-form  entirely. 

The  device  of  "programming"  the  cataloger,  (c),  as  mentioned, 
is  implied  in  every  code  of  cataloging;  but  the  needed  specific  con- 
cern for  the  enabling  of  filing- manipulation  is  present  only  accident- 
ally, if  at  all. 

Substantial  additions,  dictated  by  the  policy  desired  in  terms  of 
variations  within  the  eidetic  position,  would  be  required  if  this  de- 
vice were  to  be  successful  in  reducing  filing  to  sorting.    For  instance, 
in  the  example  of  the  two  Bible  entries,  what  is  necessary  to  place 
"Goodspeed"  before  "1964"  would  be  the  indication  of  facets:    "^Bible. 
[5,4]    ^English.    [2]   ^Goodspeed."  as  against  "6 Bible.    [5,4]    3Eng- 
lish.    ^1964.    [!].»    (The  blank  facets  "5"  and  "4»  stand  for  section 
and  book,  respectively.)    An  even  more  difficult  example  is  the  filing 
of  names  such  as  those  beginning  with  "John,"  which  can  be  followed 
by  sobriquets,  place-designators,  given  names,  titles  of  nobility  or 
sovereignty,  etc.    Two  lists  of  "John"  entries  are  given  in  the  Appen- 
dices.   Appendix  I  is  a  list  of  the  subject-heading  usage  and  ordering 
of  such  entries,  and  Appendix  II  is  a  list  of  main  entry  usage  and 
ordering  as  drawn  from  LC  printed  catalogs.    Analysis  of  these  lists 
shows  that  no  principle  is  employed  throughout  which  can  be  expected 
to  provide  the  predictability  necessary  to  locate  any  single  entry  or 
group  of  entries  without  scanning  the  whole  list.    Some  seemingly 
outlandish  placements  may  be  due  to  nothing  more  than  individual 
clerical  errors  rather  than  to  the  following  of  rules,  but  it  should  be 
remembered  that  the  more  complex  the  rules  (and  the  less  they  de- 
pend on  pure  ritual)  the  more  likely  are  cases  of  forgetfulness.    How- 
ever, in  the  filing  of  names  of  sovereigns,  for  any  desired  eidetic 
order  to  be  achieved,  it  is  necessary  not  only  to  indicate  empty  facets 
(as  with  the  Bible  entries),  but  to  label  each  facet  with  a  weighting 
factor  not  determined  by  its  position  in  the  entry  as  written,  in  ac- 
cordance with  such  a  citation  order  as  "Name.    Rank.    Area  of 
sovereignty.    Number,"  as  against  the  written  order  "Name.    Number 
[and  sobriquet].    Rank.    Area  of  sovereignty".    This  is  not  to  say  that 
such  relatively  simple-minded  means  are  capable  of  setting  right 
every  problem  of  filing  order.    Note,  for  instance,  the  resumption  of 
a  second  sub-alphabet  in  the  main  entry  list  at  "John  Alcober."    Sev- 
eral entries  that  follow  this  one,  although  treated  in  accordance  with 
a  logical  rule  similar  to  that  mentioned  in  the  earlier  example  of 
"Mass  (Canon  law)"— namely  that  a  longer  first  facet  follows  a  short- 
er, even  where  the  shorter  is  followed  by  a  subdivision-word  of 
higher  alphabetic  position— do  not  seem  legitimately  to  be  considered 
as  longer  facets.    They  do  not  since  the  usual  subdividing  facet  of  the 
first  sub-alphabet  is  a  sobriquet  or  a  place-designator,  such  as 
"  John  the  Fearless,  duke  of  Burgundy"  or  "  John  of  Cappadocia" . 
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The  lengthening  element  of  the  supposedly  longer  first  facet  is  simi- 
larly characterizable,  for  instance  "John  o' London,  pseud."  or  "John 
Chrysostom,  Saint"  (Chrysostom  is  not  a  name,  but  a  sobriquet  mean- 
ing "golden-  tongued.") 

Implicit  in  these  anomalies  are  the  data  that  can  lead  to  an  im- 
portant distinction  between  two  aspects  of  the  supreme  desideratum, 
predictability.    This  distinction  will  be  more  fully  discussed  shortly. 


An  Attempt  at  Evaluation 


Many  solutions  to  these  problems  have  been  proposed,  but  if 
they  do  not  recognize  and  control  the  actuality  of  the  difficulties  en- 
countered in  searching  and  in  cataloging,  they  can  be  called  "relative- 
ly simple-minded."    Nor  is  it  true  that  all  such  problems  are  neces- 
sarily real  ones.    A  real  problem  in  this  area  is  one  that  not  only 
presents  us  with  an  intricate  tangle  of  circumstances,  but  one  where- 
in a  real  need  is  prevented  from  being  met.    What  are  the  real  needs 
here?    The  most  important,  as  can  be  seen  from  some  of  the  fore- 
going, is  predictability.    To  need  to  find  a  particular  sovereign-person 
entry  but  to  be  unable  to  tell  where  to  find  it,  even  if  the  words  used 
in  it  are  known,  is  a  need  prevented  from  fulfillment.    The  same  is 
true  where  it  is  necessary  to  find  a  particular  subject  heading  made 
up  of  several  phrases,  even  if  one  knows  (or  is  told  syndetically)  the 
precise  style  even  of  the  punctuation  between  the  phrases;  or  where 
it  is  necessary  to  find  a  class  of  entries  that  form  a  group  (among 
others)  of  divisions  of  some  particular  subject-heading  or  corporate 
entry.    If  it  is  impossible  to  predict  where  such  entries  or  groups  of 
entries  are  located  in  the  file,  there  is  a  real  problem. 

Basically,  the  difficulty  is  with  the  lack  of  system  in  the  cata- 
loging policies  in  use  among  librarians.    As  a  somewhat  more  pro- 
tracted example,  a  discussion  of  the  subject-heading  "Art,  Byzantine" 
follows.    According  to  the  LC  authority-document  Subject  Headings,  19 
the  heading  is  among  the  fourth  group  of  divisions  of  the  heading 
"Art."    The  first  of  these  sub-alphabets  is  that  made  up  of  (a)  form- 
divisions  such  as  "Art— Congresses"  and  "Art— Philosophy,"  as  well 
as  references  from  unused  headings  such  as  "Art— Negroes  see  Negro 
art."    The  second  sub-alphabet  is  made  up  of  (b)  place  names.    If 
such  a  name  were  in  use  at  present,  we  might  expect  to  find  "Art- 
Byzantium"  here.    The  implication  is,  however,  that  this  group  is  to 
comprise  documents  on  the  artifacts  in  a  place,  rather  than  the  art- 
istry characteristic  of  a  place.    The  third  sub-alphabet  is  somewhat 
of  a  hash,  including  inverted -phrase  adjectival  (c)  styles-cum-periods 
like  "Art,  Abstract"  and  "Art,  Baroque,"  (d)  styles- cum -places  like 
"Art,  Cluniac"  and  "Art,  Oriental,"  along  with  (e)  miscellaneous 
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orientations  like  "Art,  Commercial  see  Commercial  art,"  "Art,  Im- 
moral," "Art,  Muncipal,"  and  "Art,  Regional."    (The  application  of 
facet-analysis  to  the  elements  of  this  sub-alphabet  reveals  the  full 
extent  to  which  lack  of  system  can  become  enshrined  in  the  hearts  of 
the  unwary.)   The  fourth  sub-alphabet,  to  which  belongs  "Art,  Byzan- 
tine," is  made  up  of  inverted -phrase  adjectival  (f)  ethnic -cum -place 
names  like  "Art,  African,"  "Art,  Celtic,"  and  "Art,  Greek"  (to  which 
an  xx-reference  from  "Classical  antiquities"  is  prescribed,  leaving 
the  user  somewhat  puzzled  if  he  has  in  hand  a  document  on  modern 
Greek  art).    Here  is  found  the  complement  to  the  second  sub-alphabet, 
(b):    such  headings  comprise  the  artistry  characteristic  of  Byzantium, 
rather  than  artifacts  to  be  found  in  that  city  or  empire.    The  fifth 
sub-alphabet  is  composed  of  inverted -phrase  adjectival  (g)  periods 
like  "Art,  Ancient,"  "Art,  Renaissance,"  and  "Art,  Modern— 20th 
century"  arranged  chronologically  in  defiance  of  their  verbal  form. 
The  sixth  sub-alphabet  comprises  uninverted  phrases  beginning  (h) 
"Art  and  .  .  .,"  "Art  in  .  .  .,"  and  the  like. 

Now  let  us  assume  a  person  in  need  of  documentary  information 
about  Byzantine  art,  a  person  to  whom  that  turn  of  phrase  appears 
natural.    The  syndetic  apparatus  of  LC  subject  headings  will  tell  such 
a  person  that  a  quasi -classification  under  "Art"  is  the  proper  place 
to  look  for  such  information,  rather  than  under  the  first-thought-of 
expression.    But  it  is  not  pointed  out  to  him,  except  implicitly,  where 
such  an  entry  can  be  predicted  to  be  among  all  those  beginning  "Art". 
What  is  explicitly  (but  still  not  thematically)  pointed  out  is  that  the 
form  of  the  heading-to-be-sought  is  "inverted -phrase  adjectival", 
and  that  the  adjective  is  formed  from  a  proper  name  (for  the  sake  of 
argument,  I  am  willing  to  grant  that  no  one  is  ignorant  of  the  general 
meaning  and  formation  of  "Byzantine.") 

Notice  that  this  user  cannot  assign  the  heading  "Art,  Byzantine" 
to  the  fourth  (ethnic-cum -place)  sub-alphabet  under  "Art"  because 
there  is  nothing  in  the  syndetic  link  to  tell  him  this.    More  funda- 
mentally he  cannot  because  even  though  he  might  be  aware  of  the 
groups  generally  to  be  expected  under  a  broad  heading  like  "Art"  and 
(c)  even  of  the  specific  order  of  the  ethnic-cum-place  group  among 
the  others,  there  remains  a  fundamental  difficulty.    He  may  very  well 
be  thinking  of  "Art,  Byzantine"  as  meaning  something  quite  different 
from  that  implied  by  its  official  place— as  a  style -cum -period  or 
style -cum -region  heading  (analogous  to  "Art,  Gothic")  in  the  third 
sub-alphabet,  or  as  a  period  as  such  in  the  fifth  (analogous  to  "Art, 
Renaissance").    This  difficulty  is  similar  to  that  noted  earlier  with 
regard  to  some  of  the  irregularities  of  order  under  "John":    if  a  per- 
son does  not  know  the  precise  significance  of  part  of  a  heading,  an 
eidetic  order  may  well  be  such  as  to  prevent  him  from  finding  it  un- 
less he  takes  the  time  and  trouble  to  look  through  all  the  headings 
beginning  "John"  or  "Art".    The  difficulty  arises  because  he  does  not 
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know  the  part  of  the  heading's  precise  significance  as  used  by  the 
cataloger  or  filer,  not  that  he  does  not  know  its  precise  form  (which 
the  syndetic  apparatus  reveals  to  him).    The  case  of  "Art"  is  analo- 
gous to  that  of  "John"  in  the  way  in  which  it  modifies  the  desideratum 
of  predictability. 

Notice  also  that  in  a  case  of  sorting  of  meaningless  and  "atomic" 
signs  (whether  digit-particles  like  accents,  digits,  syllables— as  in 
spoken  Japanese— or  even  words— as  in  Uigur)  the  order  is  entirely 
arbitrary  just  because  there  is  no  meaning  in  their  seriality.    (The 
numerals  "1"  through  "9"  do  not  fall  under  this  stricture  except  inso- 
far as  we  presume  the  radix  "10,"  and  they  are  a  paradigm  then  of 
both  sorting  and  filing.)   In  an  arbitrary  sorting-order  there  is  abso- 
lute predictability  of  the  location  of  "q"  between  "p"  and  "r."    The 
seriality  of  the  whole  alphabet  (as  governed  by  the  Anglo-Saxon  "arbi- 
tration") makes  it  possible  to  go  from  any  entry-point  in  the  series 
to  the  desired  point:    if  one  enters  at  "k",  one  knows  that  he  must  go 
"downstream"  about  half-way  to  the  end  of  the  series  to  hit  "q." 

In  a  file  there  is  seriality  too;  but  absolute  predictability  of  the 
kind  just  mentioned  requires  an  equally  stringent  application  of  arbi- 
trary rules  and  an  equally  absolute  awareness  of  the  precise  form  of 
the  sought  sign- complex.    It  is  the  task  of  the  syndetic  apparatus  to 
furnish  the  searcher  with  this  absolute  awareness  because  of  the 
vagaries  that  seem  inevitable  in  the  cataloger' s  choice  and  rendering 
of  entries.    The  seeker  for  "John  Chrysostom,  Saint"  may  well  not  be 
able  to  predict  the  location  of  such  an  entry  precisely  because  he  is 
aware  of  the  fact  that  "Chrysostom"  is  a  sobriquet,  while  the  cataloger 
or  filer  is  not. 


A  Conclusion  of  a  Kind 


It  is  easy  enough  to  pick  out  flaws  in  any  system,  and  far  more 
so  in  any  body  of  rules  such  as  we  are  familiar  with  for  filing,  be- 
cause none  of  them  is  really  a  system.    I  am  not  unsympathetic  to  the 
various  attempts  to  provide  the  means  for  ordering  a  file  of  complex 
verbalisms.    I  am,  however,  convinced  that  only  to  those  who  look 
the  difficulties  full  in  the  face  and  recognize  the  primordia  of  the 
concept  of  order  in  its  various  manifestations  can  there  come  hope 
of  being  able  to  devise  or  even  to  accept  improvements. 

The  need  for  predictability  is  paramount,  but  there  is  not  just 
one  predictability  to  be  sought  for.    In  terms  of  the  bifurcation 
"eidetic/ symbolic"  these  insights  appear:    (a)  symbolic  order  makes 
possible  the  predictability  of  entries  known  in  precise  detail,  where- 
as (b)  eidetic  order  makes  possible  the  predictability  of  entries 
known  in  terms  of  their  meaning.    For  instance,  entries  beginning 
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"M,"  "Mac,"  and  "Me,"  either  in  a  separate  alphabetic  sector  before 
or  after  all  other  "M.  .  ."  entries  or  in  a  sub- sector  of  "M.  .  . "  (as 
if  all  spelled  "Mac"),  make  possible  finding  entries  like  "McClare," 
"MacClennan,"  and  "M'Clore"  in  that  order,  without  the  searcher 
needing  to  advert  to  the  precise  form  of  the  prefix.    This  is  an  ex- 
ample of  the  advantage  of  eidetic  order.    One  advantage  of  the  sym- 
bolic order  is  that  the  searcher  for  "Art,  Byzantine"  could  afford  to 
ignore  the  membership  of  this  heading  in  one  of  the  sub -alphabets. 
Instead,  he  could  be  assured  that  all  comma- subdivisions  of  any 
heading  like  "Art"  would  be  in  a  predictable  place,  distinct  from 
double-dash-subdivisions,  full- stop- subdivisions,  parenthesis- 
subdivisions,  or  any  other  such. 

Can  we  afford  to  sacrifice  one  such  advantage  to  gain  the  other  ? 
I  do  not  believe  so;  and  I  believe  that  a  combination  of  two  of  the 
techniques  outlined  earlier,  namely  the  collating  sequence  and  the 
"programming"  of  the  cataloger,  can  effect  the  solution  desired  with- 
out sacrifice  of  either  advantage.    These  elements  of  a  putatively 
adequate  filing  system  are  both  indispensable  anyway,  since  all  filing 
implies  (a)  ritually  sortable  elements  to  constitute  its  filed  complex- 
es, and  it  is  the  collating  sequence  that  makes  such  ritual  sorting 
possible.    It  also  implies  (b)  semantic  content  to  these  complexes, 
and  the  complementary  control  of  them  by  the  application  of  biblio- 
graphical control  by  the  cataloger. 20   if  these  elements  are  combined 
with  foresight,  there  can  arise  these  two  distinct  and  important  re- 
sults:   (a)  an  eidetically  determined  filing  order  symbolically  sort- 
able,  and  (b)  a  flexible  computer -contained  surrogate  capable  of 
serving  as  a  basis  for  a  wide  variety  of  output  purposes,  21  both  ritu- 
ally manipulable.    By  such  a  combination  of  devices,  which  would  im- 
ply the  tagging  of  a  carefully  determined  totality  of  the  sub-elements 
of  the  whole  complex  of  descriptive  elements,  there  could  be  brought 
about  any  order  desired,  as  long  as  such  a  desired  order  was  based 
on  ritually  consistent  principles,  and  any  degree  of  abbreviation  of 
the  entry  as  a  whole.    The  main  point  to  grasp  here  is  that  the  greater 
the  number  of  uniquely  tagged  sectors,  the  greater  the  flexibility  for 
both  these  purposes.    These  sectors  will  not  be  easy  to  enumerate, 
nor  will  it  be  a  simple  matter  to  devise  unique  codes  for  them.    It  is 
probably  possible  only  with  the  use  of  mutually  modifying  codes. 

For  instance,  the  combination  of  eidetic  and  symbolic  order, 
through  the  conscious  cooperation  of  the  cataloger  and  the  collating 
sequence,  could  give  this  order  to  the  sub-alphabets  of  "Art":    "Art 
—Congresses"  and  other  forms,  "Art:    London"  and  other  places  in 
which.  .  .,  "Art  'Gothic'"  and  other  styles  etc.,  "Art;  Byzantine"  and 
other  places  of  which.  .  .,  "Art/1901-     "  (=20th  century  modern)  and 
other  periods,  "Art  and  .  .  .,  Art  in  .  .  ."  and  other  uninverted  phras- 
es, and  finally,  though  not  given  (nor  needed  here)  in  Subject  Headings, 
"Art  (Qualified)."    (In  such  an  arrangement  the  punctuation  used  must 


63 

not  be  allowed  to  be  used  for  any  other  purpose,  at  least  not  in  subject- 
headings.    This  is  what  I  mean  by  mutually  modifying  codes.)    Thus 
the  heading  "Art,  Byzantine"  would  either  become  "Art;  Byzantine," 
with  see -references  from  "Art  'Byzantine'"  and  "Art/[Byzantine  peri- 
od]" and  (perhaps)  "Art:    Byzantium"— to  force  all  usages  of  the  in- 
verted phrase  to  be  located  at  the  putatively  most  helpful  place,  or 
there  could  be  entries  allowed  under  each  of  the  three,  depending 
upon  the  emphasis  of  the  document  being  cataloged  (ethnic/ geographic, 
stylistic,  or  temporal),  with  see-also  references  connecting  them. 

The  steps  to  such  a  solution  must  include  (a)  comparative  analy- 
sis of  the  various  eidetic  and  symbolic  orders  advocated,  in  the  light 
of  basic  principles  such  as  have  been  enunciated  here,  (b)  tabulation 
of  the  sectorings  necessary  to  achieve  all  these  orders— insofar  as 
they  can  be  seen  to  be  truly  desirable  in  terms  Of  the  real  problems 
of  searching— from  a  single  input-record,  and  (c)  checking  to  see  wheth- 
er additional  sectoring  is  necessary  to  enable  selective  abridgements. 
This  ambitious  program  does  not,  of  course,  mean  that  we  cannot 
achieve  improvements  right  now.    Already,  in  the  design  of  various 
computerized  cataloging  systems  (as  at  Florida  Atlantic  University,  22 
the  Ontario  New  Universities  Library  Project, 23  the  University  of 
California  at  Santa  Cruz, 24  and  the  medical  libraries'  union  catalog 
at  Yale,  Harvard,  and  Columbia  Universities25)>  various  systems  of 
use  of  computers  in  filing  have  been  worked  out.    These  may  have 
been  less  than  ideal  solutions,  but  if  they  were,  it  is  primarily  be- 
cause of  the  ad  hoc  nature  of  the  attempts.    They  were  not  worked  out 
as  consciously  partial  solutions  of  the  problem  as  a  whole.    The  com- 
bination of  a  careful  re -examination  of  our  prevalent  attitudes  on 
preferred  eidetic  order(s),  of  the  introduction,  where  necessary,  of 
flexibility  in  the  collating  sequence,  of  the  development  of  the  codifi- 
cations which  can  make  cataloging  rules  bear  along  with  them  all  that 
is  needed  for  the  accomplishment  of  the  two  central  purposes— this 
is  what  ultimately  must  be  sought,  even  when  we  wish  to  make  only 
small  practical  improvements. 
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Toronto  Press,  1962,  pp.  159-161. 

"Applied  knowledge  in  the  Renaissance  had  to  take  the 
form  of  translation  of  the  auditory  into  visual  terms,  of  the  plastic 
into  retinal  form;"  note  in  particular  his  quotation  from  Father  Ong 
(p.  160):    "The  spatial  processing  of  sound  by  means  of  the  alphabet 
is  not  enough.    Printed  or  written  words  themselves  must  be  deployed 
in  spatial  relationships,  and  the  resulting  schemata  thought  of  as  a 
key  to  their  meanings." 

8.  Hjelmslev,  Louis.    "Prolegomena  to  a  Theory  of  Language," 
Translated  by  Francis  J.  Whitfield  In  Indiana  University  Publications 
in  Anthropology  and  Linguistics,  Vols.  1-10  (1948-54):    Memoir  7, 
pp.  29-38. 

He  points  out  here  the  non-cotermineity  of  the  individual 
members  of  two  groups  (in  English  and  Welsh)  of  words— and  again, 
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9.  Ceccato,  Silvio,  et  al.    Linguistic  Analysis  and  Program- 
ming for  Mechanical  Translation.   New  York,  Gordon  and  Breach, 
1961,  p.  59  ff. 

10.    Lubetzky,  Seymour.    Letter  dated  Oct.  13,  1964. 

This  is  forcefully  pointed  out  (in  answer  to  my  paper 
cited  in  reference  4),  in  a  passage  whose  import  I  have  attempted  to 
integrate  with  my  own  prevailing  conviction  of  the  supremacy  of  the 
collating  sequence  (i.e.  of  sorting  as  the  ideal  ritual  to  make  possible 
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computer -filing);  he  argues  that  "the  basic  difficulty  lies  in  our  use 
of  the  same  symbols— the  dash  or  the  comma— for  various  types  of 
subdivisions,  and  I  believe  that  the  development  and  use  of  a  system 
of  symbols  to  designate  and  distinguish  the  different  types  of  sub- 
division would  add  a  new  dimension  to  the  organization  of  the  catalog 
and  contribute  materially  to  its  intelligibility  and  effectiveness." 
This  hope  stems  from  his  conviction  that  "to  theorize  that  filing  is 
implicitly  alphabetization,  not  classification,  is  quite  erroneous  and 
misleading."    I  am  not  willing  to  go  that  far,  at  least  terminologically, 
but  the  same  insight  is  stressed  here  that  motivated  Ranganathan's 
statement  quoted  at  the  beginning  of  this  paper. 

11.  The  Prussian  Instructions;  Rules  for  the  Alphabetical  Cata- 
logs of  the  Prussian  Libraries.    Translated  from  the  Second  Edition, 
Authorized  August  10,  1908,  with  an  Introduction  and  Notes  by  Andrew 
D.  Osborn,  Ann  Arbor,  U.  of  Michigan  Press,  1938,  pp.  27-106 

et  passim. 

12.  Ranganathan,  op.  cit.,  pp.  59-63. 

13.  Fugmann,  Robert.    "Ordnung— oberstes  Gebot  in  der  Doku- 
mentation,"  Nachrichten  fiir  Dokumentation,  13:121,  Number  3,  Sept. 
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14.  Needham,  Christopher  D.    Organizing  Knowledge  in  Li- 
braries.   London,  Andre  Deutsch,  1964,  pp.  181-183. 

15.  According  to  an  unpublished  study  made  by  Barbara  Marku- 
son  of  the  Information  Systems  Specialist's  Office  of  the  Library  of 
Congress,  47  percent  of  the  authors  examined  had  only  one  entry,  61 
percent  one  or  two,  66  percent  one,  two,  or  three;  thus  the  hope  for 
consistent  return  of  author  entries  may  be  disappointed  as  well. 

16.  Cf.  e.g.  Srygley,  Ted  F.    "Serials  Record  Instructions  for 
a  Computerized  Serial  System,"  Library  Resources  &  Technical  Ser- 
vices, 8:248-256,  Summer  1964.  ~ 

Schultheiss,  Louis  A.  "Two  Serial  Control  Card  Files  De- 
veloped at  the  University  of  Illinois,  Chicago,"  Library  Resources  & 
Technical  Services,  9:271-287,  Summer  1965. 

Stewart,  Bruce  Warren.    A  Computerized  Serials  Record 
System  for  the  Texas  A  &  M  University  Library.    College  Station, 
Texas  A  &  M  University  Library,  1965. 

17.  Perreault,  Jean  M.    "Documentary  Relevance  and  Structur- 
al Hierarchy,"  Information  Storage  and  Retrieval,  3:13-18,  Number  1, 
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18.  Gore,  Daniel.    "Encounter  with  a  Cataloger,"  Library  Re- 
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22.  See:    Heiliger,  Edward  M.,  et  al.  "Florida  Atlantic  Univer- 
sity. "  ^olle^e_&^e^e^r^hjdib^aries,  25:181-199,  Number  3,  1964; 
and  Perreault,  Jean  M.    "Computerized  Cataloging:    The  Computer- 
ized Catalog  at  Florida  Atlantic  University,"  Library  Resources  & 
Technical  Services,  9:20-24,  Number  1,  1965~ 

23.  See:    Bregzis,  Ritvars.    "Compatible  Format  for  Machine 
Readable  Record  of  Bibliographic  Data,"  paper  read  at  the  1965  Con- 
gress of  the  FID;  Proceedings  of  the  Congress  to  be  published  mid- 
1966;  and  also  his  "The  Ontario  New  Universities  Library  Project- 
an  Automated  Bibliographic  Data  Control  System,"  College  &  Re- 
search Libraries.  26:495-508,  Number  6,  1965. 

24.  See:    Computer  Usage  Company.    Specification  for  an  Auto- 
mated Library  System.    Palo  Alto,  1965. 

25.  See:    Kilgour,  Frederick  G.    "Development  of  Computeriz- 
ation of  Card  Catalogs  in  Medical  and  Scientific  Libraries."   In  Herbert 
Goldhor,  ed.,  Proceedings  of  the  1964  Clinic  on  Library  Applications 

of  Data  Processing.  Urbana,  Illinois,  University  of  Illinois  Graduate 
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Appendix  I 


A  list  of  subject  heading  uses  of  "John.  .  .,"  from:    Library  of  Con- 
gress Catalog;  (Books:    Subjects)  a  Cumulative  List  of  Works  Repre- 
sented by  Librarv~of  Congress  Printed  Cards.  1955-1959.  11:    580- 
581.    Patterson,  New  Jersey,  Pageant  Books,  Inc.,  1960. 


JOHN  BERCHMANS,  SAINT 

JOHN  BOSCO,   SAINT 

JOHN  CAPISTRAN,  SAINT 

JOHN  CLIMACUS 

JOHN  FISHER,  SAINT 

JOHN  FREE 

JOHN,  KING  OF  ENGLAND 

JOHN  OF  GAUNT,  DUKE  OF 
LANCASTER 

JOHN  OF  JANDUN 

JOHN  OF  KRONSTADT,  FATHER 

JOHN  OF  LEYDEN 

JOHN  OF  ROQUETAILLADE 

JOHN  OF  SALISBURY,  BP.  OF 

CHARTRES 

JOHN  OF  THE  CROSS,  SAINT 
JOHN  XXHI,  POPE 
JOHN,  PRESTER 
JOHN,  SAINT,  APOSTLE 
JOHN  THE  BAPTIST 
JOHN  VIANNEY,  SAINT 
JOHN,  OTTO 


JOHN  (BOOK  OF  THE  NEW 
TESTAMENT) 

JOHN,  EPISTLES  OF  (BOOK  OF 
THE  NEW  TESTAMENT) 

JOHN  BROWN  RAID,  1859 

JOHN  CRERAR  LIBRARY, 
CHICAGO 

JOHN  DICKINSON  AND  COM- 
PANY, LTD. 

JOHN  GOFFE'S  MILL,  BEDFORD, 
N.  H. 

JOHN  HANCOCK  BUILDING, 
SAN  FRANCISCO 

JOHN  LEWIS  PARTNERSHIP,  LTD. 

JOHN  MORE  ASSOCIATION 

JOHN  MUIR  TRAIL,  SIERRA 
NEVADA  MOUNTAINS 

JOHN  NEWBERY  MEDAL  BOOKS 

JOHN  RYLANDS  LIBRARY,  MAN- 
CHESTER 

JOHN  THE  BAPTIST'S  DAY 

JOHN  WOODMAN  HIGGINS  AR- 
MORY, WORCESTER,  MASS. 
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A  list  of  author  uses  of  "John.  .  .,"  from:    A  Catalog  of  Books  Repre- 
sented by  Library  of  Congress  Printed  Cards.  Issued  to  July  31. 
1942>  76:616-621.    Ann  Arbor,  Michigan,  Edwards  Brothers,  Incorpo- 
rated, 1944. 


John  III,  King  of  Portugal 
John,  abbot  of  Ford 
John  (bar  manager) 
John,  bp.  of  Lincoln 
John,  bp.  of  Nikiu 
John,  Brother 


John  de  Bordeaux 
John  de  Britto 
John  de  Halton 

John  de  Pontissara,  bp.  of  Win- 
chester 
John  de  Sandale,  bp.  of  Winchester 
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John,  de  Taxster 

John  of  Burgundy 

John  of  Cappadocia 

John,  of  Damascus,  Saint 

John  of  Gaddesden 

John,  of  Garland 

John  of  Gaza 

John  of  Genoa 

John  of  Halton,  bp.  of  Carlisle 

John,  of  Hildesheim 

John  of  Hoveden 

John  of  Ire 'land 

John  of  Jandun 

John  of  Lancaster,  pseud. 

John,  of  Nepomuk,  Saint 

John  of  Pian 

John  de  Carpine 

John  of  Ravenna 

John  of  Salisbury,  bp.  of  Chartres 

John,  of  Tynemouth 

John  of  Wales 

John  of  Worcester 

John,  Prester 

John,  son  of  Sabanis 

John,  the  Evangelist 

John  the  Faster 

John,  the  Fearless,  duke  of 

Burgundy 
John,  the  Painter 


John  Wallensis 

John  Alcober 

John  Ambrose,  father 

John  Amundesham 

John  Baptist,  archduke  of  Austria 

John  Boston,  of  Bury 

John  Chrysostom,  Saint 

John  de  Burgh 

John  de  Oxenedes 

John  Francis  Regis,  Saint 

John  Gabriel,  sister 

John  Gualbert 

John  Henry,  pseud. 

John  Joseph  Laurence  Charles 

Louis,  archduke  of  Austria 
John  Mary  of  Reifenberg,  father 
John  o' London,  pseud. 
John  of  God 

John  of  Gaunt,  duke  of  Lancaster 
John  of  Saint  Thomas 
John  Paul,  pseud. 
John  Sackland 
John  III.    Sobieski,  King  of 

Poland 

John  Zapolya,  King  of  Hungary 
John,  Alfred 


Appendix  III 


Inverted  Citation  Order 


In  order  to  insure  that  whole  comes  before  part,  general  before 
specific,  the  principle  of  citation  order  in  classification  is  subjected 
to  inversion.    Citation  order  results  in  predictability  among  the  indi- 
vidual concepts  ascribed  to  a  single  document;  inversion  of  citation 
order  results  in  predictability  among  the  complexes  (i.e.,  the  docu- 
ments themselves  or  their  surrogates). 

Two  techniques  are  displayed  below,  either  of  which  can  assure 
proper  inversion.    There  must,  however,  be  a  decision  ahead  of  time 
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as  to  the  citation  order  upon  which  the  inversions  are  to  be  based. 
Or,  if  there  seems  no  clear  preference  as  to  citation  order,  filing 
order  can  be  decided  first,  and  the  citation  order  can  be  derived  as 
its  inversion.    In  practice  the  two  lines  of  attack  may  be  exercised 
simultaneously. 

TECHNIQUE  A.    With  postulated  facets  "place,"  "period,"  and 
"form,"  of  a  subject  Y,  we  can  construct  a  matrix  showing  citation 
order,  and  then  deduce  all  the  patterns  of  occurrence: 


'place"       "period"       "form" 


Y 
Y 
Y 
Y 
Y 
Y 
Y 

X 

(do 

(c) 
(d) 
(e) 
(f) 
fe) 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

(document  a) 


The  occurrences  are  in  terms  of  foci  from  each  facet,  of 
course.    For  instance,  if  Y  is  "architecture, "document  (a)  could  be  a 
bibliography  of  architecture,  (b)  a  study  of  architecture  in  the  20th 
century,  (c)  a  bibliography  of  architecture  in  the  20th  century,  (d)  a 
study  of  the  architecture  of  France,  (e)  a  bibliography  of  architecture 
in  France,  (f)  a  study  of  20th  century  architecture  in  France,  and 
(g)  a  bibliography  of  20th  century  architecture  in  France. 

TECHNIQUE  B.    Alternatively,  the  citation  order  can  be  ex- 
pressed as  an  (ad  hoc)  weighting  device.    If  the  facet  "place"  is  to 
come  first  in  the  citation  order,  it  should  be  accorded  the  greatest 
weight,  "period"  the  next  greatest,  "form"  the  least,  and  empty  facets 
zero.    Thus  "place"  =  3,  "period"  =  2,  and  "form"  =  1,  so  that  docu- 
ment (a)  receives  the  weight  001,  (b)  020,  (c)  021,  (d)  300,  (e)  301, 
(f)  320,  and  (g)  321,  putting  them  into  the  same  order  as  before. 

These  techniques  are  appropriate  not  only  to  subject  classifi- 
cation, but  to  the  cataloging  code  as  well.    For  instance,  facet  analy- 
sis is  implicitly  present  in  the  construction  of  anonymous- classic 
headings  such  as: 

Bible.  English.    Goodspeed. 

Bible.  English.    1964. 

Bible.  N.T.    English.    1964. 

Bible.  N.T.    Galatians.    English.    Goodspeed. 

Bible.  N.T.    Galatians.    English.    1964. 

Bible.  N.T.    Galatians.    1964.    Goodspeed. 
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'part"      'book"      "language"      "date"      "version' 


X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

The  basic  principle  operating  here  is  "nothing  before  some- 
thing":   the  unspecified  before  the  specified,  the  whole  before  the 
part,  or  the  general  before  the  specific.    The  desired  effect  is  pre- 
dictability—here, of  finding  the  desired  document  or  surrogate  (while 
citation  order  makes  the  analysis  of  the  document  and  the  creation  of 
the  surrogate  predictable). 


Appendix  IV 


Compression  of  Titles 


A  sample  of  titles  was  chosen  from  the  Catalog  of  Books  Repre- 
sented by  Library  of  Congress  Printed  Cards  .  .  .  1942,  according  to 
the  following  rule:    the  first  English  title  in  the  third  column  or  the 
next  following  page  on  which  an  English  title  appears,  of  pp.  100,  200, 
300,  etc.  of  vol.  1,  the  same  from  pp.  101,  201,  301,  etc.  of  vol.  3,  pro- 
gressing similarly  through  each  odd-numbered  volume  up  to  vol.  23; 
whereafter  it  was  re-cycled  from  p.  100  of  vol.  25,  101  of  vol.  30, 
etc.    This  resulted  in  a  random  sample  of  235  titles.    English  was  the 
only  language  accepted,  not  because  there  is  no  problem  with  foreign 
titles,  but  because  the  probability  of  coming  to  a  clear  understanding 
of  basic  principles  is  higher  under  such  restrictions. 

Two  simultaneous  tabulations  were  made: 

(A)  Of  types  of  main  entry  encountered,  which  yielded  11  title 
main  entries,  10  corporate  main  entries,  and  214  personal  main  en- 
tries.   The  smallness  of  the  proportion  of  entries  not  of  the  "normal" 
pattern,  i.e.,  of  a  title  distinctly  subordinate  to  a  personal  responsi- 
bility, is  significant,  since  the  form  of  title  in  such  "normal"  cases 
is  not  as  crucial  in  the  finding  operation  as  in  corporate  main  entries 
or  in  title  main  entries.    (It  was  not  tabulated  whether  some  of  the 
"odd"  21  might,  by  a  different  decision,  have  been  considered  person- 
al main  entries,  nor  whether  any  of  the  "normal"  214  might  have  been 
considered  "odd"  by  a  different  code  for  choice  of  entry.)    It  is  per- 
haps interesting  to  note  that  none  of  the  other  main  classes  of  main 
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entry  occurred,  namely  anonymous  classics  or  uniform  headings. 

(B)  Of  the  structure  of  the  titles  themselves  in  terms  of  ad- 
ditional verbiage  beyond  the  "core "-title,  which  yielded 
1  alternative  title  as  expansion  of  acronym, 

1  sectional  title, 

2  mentions  of  contents, 

4  initial  words  or  phrases  to  be  arbitrarily  omitted  as  in- 

significant, 

5  mentions  of  redaction, 
7  initial  possessives, 

9  alternative  titles  introduced  by  "or," 
9  supplementary  sub -titles, 
11  mentions  of  occasion, 
24  form  sub- titles, 
30  explanatory  sub-titles, 
39  mentions  of  scope,  and 
91  initial  non-filing  articles. 

And,  where  a  new  class  can  be  defined  by  the  simultaneous  member- 
ship of  one  title  in  two  or  more  of  the  above  in  the  same  sector  of  the 
title,  the  tabulation  yielded 

1  explanatory/ form/ supplementary  sub-title, 

1  mention  of  occasion/explanatory  sub-title, 

2  alternative/explanatory  sub-titles, 

2  mentions  of  occasion/ form  sub-titles, 
2  mentions  of  redaction/supplementary  sub-titles, 
4  mentions  of  scope/explanatory  sub-titles,  and 
4  explanatory/form  sub-titles. 

The  significant  judgment  to  be  made  on  these  structural  types 
is  one  of  these  three:    (a)  Is  there  a  sector  other  than  the  "core"- 
title  that  should  also  be  traced?    (b)  Is  there  a  sector  other  than  the 
"core" -title  that  should  be  omitted  altogether  at  the  time  of  com- 
pression?   (c)  Is  there  a  sector  other  than  the  "core" -title  that  is 
necessary  for  the  intelligibility  of  the  "core "-title,  but  which  should 
not  be  re-traced  even  when  retained?    Those  classes  that  satisfy  (a) 
are  mentions  of  contents,  sectional  titles,  and  alternative  sub-titles; 
those  that  satisfy  (b)  are  explanatory  and  supplementary  sub-titles, 
as  well  as  some  mentions  of  scope  and  some  form  sub-titles;  those 
that  satisfy  (c)  are  mentions  of  redaction  and  the  residual  mentions 
of  scope  and  form  sub-titles.    This,  in  terms  of  the  sample,  yields  12 
sectors  to  be  re-traced,  til. 5  to  be  dropped,  and  ±37.5  to  be  retained 
but  not  retraced.    The  remaining  titles  did  not  have  such  sectors, 
consisting  of  "core" -title  alone. 

Can  the  computer  abbreviate  titles  according  to  these  criteria? 
It  might  be  that  it  could  do  so  if  standardized  catalog- entry  data  were 
such  as  invariably  to  indicate  these  various  functions— but  the  lack  of 
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precision  as  to  the  various  types  of  non-"core"  sectors  of  titles,  out- 
side this  investigation,  makes  it  a  priori  impossible  for  such  an  indi- 
cation to  be  expected.    Something  close  to  such  an  indication  is  im- 
plied by  the  practice  of  introducing  alternative  title  in  "or"  by  the 
invariable  punctuation  ";  or,".  But  examination  of  the  titles  in  the 
sample  leads  to  near-total  lack  of  faith  as  far  as  any  expectation  of 
such  consistency  from  LC  cataloging  is  concerned.    (It  can,  of  course, 
be  argued  that  this  inconsistency  is  traceable  back  to  the  title-pages 
themselves,  but  any  cataloger  can  easily  dispute  such  an  allegation, 
there  being  all  sorts  of  arbitrary  re -punctuations  to  be  found  in  LC 
cataloging— although  not  for  the  sake  of  the  establishment  of  any 
function  such  as  automatic  filing/tracing/ abbreviation  decisions.) 

It  is  therefore  recommended  that  four  punctuation  symbols  be 
used  in  titles  for  purposes  of  functional  sectoring,  and  that  they  not 
be  allowed  to  be  used  in  their  common  senses.  They  are: 

(a)  the  colon  (:)  to  indicate  the  re-tracing  of  mentions  of  con- 
tent and  of  sectional  titles, 

(b)  parentheses  (())  to  indicate  the  re-tracing  of  alternative 
subtitles, 

(c)  the  semi- colon  (;)  to  indicate  that  the  subsequent  data  are 
not  to  be  retraced  or  even  retained  at  the  time  of  compression,  and 

(d)  the  double-dash  ( — )  to  indicate  the  retention  of  the  subse- 
quent data,  without  re- tracing  it. 

One  further  symbol  is  to  be  used  in  a  restricted  way,  the  virgule  (/); 
see  below.    This  leaves  enough  punctuation  symbols  with  their  com- 
mon usages  (.  ,  ?  "  '  -  *  &)  to  avoid  problems  in  the  ordinary  gram- 
matical organization  of  the  entry. 

There  is  a  large  number  of  words  which  recur  often  enough  to 
justify  their  invariable  abbreviation,  at  least  at  the  time  of  compres- 
sion.   The  sample  from  which  the  following  list  is  derived  is  certainly 
not  large  enough  to  lead  us  to  think  that  they  form  an  exhaustive  tabu- 
lation, but  they  do  form  a  sort  of  nucleus: 


Amer(ican) 

b(oo)k 

coll(ege) 

collec(ted)  (tion) 

compl(ete) 

conc(erning) 

considered ) 

dev(elopment) 

diss(ertation) 

e(ast) 

ed(ited)  (itor)  (ition) 

educ(ation) 

Eng(lish) 


ess(ential) 
extr(act)  (acted) 
gen(eral) 
G(rea)t  Br(itain) 
h(an)db(oo)k 
interpr(etation) 
intro(duction)  (duc- 

tory) 
iss(ue) 
libr(ary) 
man(ual) 
mo  (nth) 
n(orth) 
org(anization) 


p(ar)t 

pol(itical) 

pr(inted)  (inting) 

published )  (ication) 

repr(inted)  (inting) 

sect(ion) 

sel(ected)  (ection) 

s(outh) 

tr(anslated)  (anslator)  (ans- 

lation) 
univ(ersity) 
U(nited)  S(tates) 
w(est) 
y(ea)r 
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The  standard  abbreviations  for  the  days  of  the  week,  the  months 
of  the  year,  the  states  of  the  Union  should  be  used  plus  those  for 
form-divisions  given  in  Subject  Headings. 

Such  words  are  not  as  frequent  of  repetition  as  are  certain 
prepositional-phrase-introducers  (which,  while  they  do  not  each  take 
up  so  much  space,  do  so  in  the  aggregate),  such  as: 

as  regards  (a,  an,  the) 
at  (a,  an,  the) 
by  (a,  an,  the) 
for  (a,  an,  the) 
from  (a,  an,  the) 
in  (a,  an,  the) 
of  (a,  an,  the) 
to  (a,  an,  the) 
with  (a,  an,  the). 

It  is  recommended  that  these  phrase -introducers,  together  with  the 
article  following  them,  be  programmed  to  be  reduced  to  the  virgule 
at  the  time  of  compression. 

There  are  accordingly  three  steps  in  compression  of  titles,  and 
they  can  be  most  satisfactorily  accomplished  in  two  stages  in  each  of 
two  possible  ways: 

(either)  (or) 

(a,b)   functional  punctuation  insert-  (a)       functional  punctuation 

ed  by  the  cataloger;  abbrevi-  inserted  by  the  cataloger 

ation  of  frequent  words  by  (b,c)   programmed  abbrevia- 
the  cataloger  tion  of  frequent  words; 

(c)       programmed  excision  of  programmed  excision  of 

phrase- introducers  (='/')  phrase- introducers  (='/') 

These  techniques,  when  applied  to  the  235-title  sample,  result 
in  compression  of  all  titles  to  40  or  fewer  characters  except  in  19 
cases;  the  compressed  titles  thus  truncated  are  somewhat  less  than 
intelligible  in  all  but  8  cases.    It  seems  unlikely  that  any  further 
compression  can  be  achieved  except  in  terms  of  further  truncation 
and  of  a  direct  increase  in  the  number  of  unintelligibilities  that  re- 
sult—except by  the  use  of  a  longer  list  of  frequent  words  to  abbrevi- 
ate. 

The  final  problem  is  that  of  the  elimination  of  words  and  phras- 
es standing  at  the  head  of  the  title,  rather  than  being  sub-sectors 
coming  after  the  "core" -title.    There  are  three  types  of  such  initial- 
isms  to  be  eliminated:    simple  initial  articles,  initial  possessives  re- 
ferring to  the  main  entry,  and  insignificant  phrases  of  introduction. 
Initial  articles  (in  English)  can  be  quite  easily  programmed  out  of  the 
filing  order  in  the  same  computer-pass  which  reduces  phrase- 
introducers  to  virgules;  however,  the  total  elimination  of  such 
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articles  in  the  compressed  print-out  is  not  recommended  except 
where  it  is  noticed  (by  program)  that  such  retention  will  result  in 
truncation  of  sectors  neither  re-traced  nor  to  be  eliminated,  namely 
if  the  title  is  sectored  by  a  double-dash  or  is  not  sectored  at  all.    Of 
the  19  titles  truncated  at  character  41,  12  began  with  articles,  and 
the  application  of  the  ritual  questions  here  recommended  would  have 
mollified  the  resultant  unintelligibility  by  dropping  the  initial  articles 
in  11  of  those  cases. 

Initial  possessives  should  always  be  eliminated  from  the  very 
first  if  they  refer  to  the  main  entry;  there  is  only  one  case  in  the 
sample  that  does  not  reproduce  the  main  entry,  and  must  accordingly 
not  be  dropped.    This  "exception"  occurs  not  so  seldom  in  the  real 
world,  especially  in  humanistic  and  historical  studies  (e.g.,  "Aris- 
totle's Contribution  to  .  .  .,"  by  Barker,  Mansion,  or  the  like). 

The  4  arbitrarily  excised  initialisms  (type  c)  are  characteris- 
tically equivalent  to  form  sub-titles,  but  their  position  causes  trouble 
in  that  the  same  symbol  cannot  be  used  to  exclude  them  that  was  used 
for  terminal  exclusion  of  form  sub-titles,  namely  the  semi-colon. 
Accordingly,  it  is  recommended  that  such  initialisms,  when  recog- 
nized by  the  cataloger  as  accompanying  a  title  that  will  result  in  be- 
ing truncated  by  compression,  should  be  excised  from  the  very  first, 
so  as  not  to  appear  in  the  full  title  any  more  than  in  the  compressed. 
Perhaps,  as  a  concession,  the  brief  lacuna- symbol  (..)  could  be  used 
to  indicate  such  excision. 

The  titles  that  make  up  the  sample  from  which  these  conclusions 
are  drawn,  first  in  raw  form  (with  indication  of  structural  type  of 
sectoring)  and  then  as  compressed  (with  truncated  parts  underlined), 
are 

History  of  King  Richard  the  Second  of  England 
HIST. /KING  RICHARD  THE  2D/ENG. 

Elementary  mechanical  drawing,  for  school  and  shop  (explanatory/ 

scope) 
ELEMENTARY  MECHANICAL  DRAWING- -SCHOOL  +  SHOP 

The  Joseph  Leidy  commemorative  meeting,  held  in  Philadelphia, 

Dec.  6,  1923  (occas.) 
THE  JOSEPH  LEIDY  COMMEMORATIVE  MEETING;  Philadelphia, 

Dec.  6,  1923 

A  relation  of  the  great  river  of  Amazons  in  South -America  (scope) 
A  RELATION/GREAT  RIVER/ AMAZONS;  s.  amer. 

Provincial  society,  1690-1763  (scope) 

Address  to  the  people  of  Pennsylvania  (title  main  entry) 
ADDRESS/PEOPLE/PA. 
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War  against  war;  or,  The  joys  of  peace  (alternative) 
WAR  AGAINST  WAR  (the  joys  of  peace) 
THE  JOYS  OF  PEACE 

A  manual  of  instruction  in  Latin  on  the  basis  of  a  Latin  method.  .  . 

(explanatory) 
A  MAN./DSfSTRUCTION/LATIN;/basis/latin  method.  .  . 

The  Sumner  assault.    Speech  of.  .  .  (occasion,  form) 
THE  SUMNER  ASSAULT;  speech  of.  .  . 

Hungary  before,  during  and  after  the  great  war  (scope) 
HUNGARY- -BE FORE,  DURING  +  AFTER  THE  GREAT  WAR 

The  Amaranth;  a  gift  for  all  seasons  (title  main  entry,  supplementary) 
THE  AMARANTH;  a  gift  for  all  seasons 

Bulletin  (corporate  main  entry) 
The  perfect  tribute 

Annable's  grain  calculator,  showing  at  a  glance  the  number.  .  .  (init, 

explan. ) 
GRAIN  CALCULATOR;  showing/ glance  the  number.  .  . 

Queen  Elizabeth 
The  chace 

Arbor  day  planting  in  eastern  states  (title  main  entry,  scope) 
ARBOR  DAY  PLANTING- -EASTERN  STATES 

The  Arena  (title  main  entry) 

Report  of  audit  (corporate  main  entry) 

Farm  experiments  with  fertilizers 
FARM  EXPERIMENTS/FERTILIZERS 

Guide  to  the  material  on  Swedish  history  in  the  Augustana  college 

library  (scope) 
GUIDE/MATERIAL/SWEDISH  HIST. —AUGUSTANA  COLL.  LIBR. 

Adelaide 

Consular  and  other  conventions  between  Austria -Hungary  and  Servia 

(initialism) 
CONVENTIONS/ AUSTRIA-HUNGARY  +  SERVIA 

Translation  of  the  Mining  law  of  Mexico,  effective  August  1,  1926.  .  . 

(init.,  occas.) 
MINING  LAW/MEXICO;  effective.  .  . 

Progress  in  the  science  of  education  in  the  last  twenty-five  years 

(scope) 
PROGRESS/SCIENCE/EDUC.--LAST  25  YRS. 
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A  collection  of  rare  and  valuable  books 

A  COLLECTION/ RARE  +  VALUABLE  BKS.    ('collection'  not  abbrev., 
because  first  word) 

Q39 

The  Barbers'  journal  (title  main  entry) 

An  introduction  to  the  Italian  language 
AN  INTRO. /IT ALLAN  LANGUAGE 

Marti,  a  story  of  the  Cuban  war  (explanatory/form) 
MARTI- -A  STORY/CUBAN  WAR 

A  brief  sketch  of  the  history  of  the  Catholic  church  in  the  island.  .  . 

(init,  scope) 
THE  CATHOLIC  CHURCH  IN  THE  ISLAND.  .  . 

A  sketch  of  the  doctrine  relative  to  commitments  in  bankruptcy 
THE  DOCTRINE  RELATIVE/COMMITMENTS  LN  BANKRUPTCY 

Sex,  marriage  and  motherhood 
SEX,  MARRIAGE  +  MOTHERHOOD 

Reconnaissance  of  the  gold  fields  of  southern  Alaska  with  some 

notes.  .  .  (suppl.) 
RECONNAISSANCE/GOLD  FIELDS/S.  ALASKA;/some  notes.  .  . 

The  eight  crooked  trenches 
All  in  it 

History  of  the  Forty-eighth  Ohio  vet.  vol.  inf.  giving  a  complete.  .  . 

(explan. ) 
HIST./48TH  OHIO  VET.  VOL.  INF.;  giving  a  compl.  .  .  . 

Labor  disputes  and  the  President  of  the  United  States 
LABOR  DISPUTES  +  THE  PRESIDENT/U.  S. 

The  fundamentals  of  naval  tactics 

THE  FUNDAMENTALS/NAVAL  TACTICS 

Joseph  King  Fenno  Mansfield 
Personal  and  literary  memorials 

Germany  before  the  war 
GERMANY- -BE FORE  THE  WAR 

Solar  hydrodynamics 

Commentaries  on  the  law  of  England.  .  .  the  19th  ed.  With  the  last 
corrections  of  the  author,  and  copious  notes.  By  J.  E.  Hoveden 
(redaction) 

COMMENT  ARIES/LA  W/ENGLAND--/NOTES/HOVENDEN 
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Co-operative  movement  in  Russia  (scope) 
CO-OPERATIVE  MOVE  ME  NT --RUSSIA 

The  roads  and  road  material  of  Indiana  (scope) 
THE  ROADS  +  ROAD  MATE  RIAL/ INDIAN  A 

Agricultural  climatology  of  the  United  States  compared  with  that  of 
other  parts  of  the  globe  (scope  embedded  in  'core') 

AGRICULTURAL  CLIMATOLOGY/ U.S.  COMPARED/OTHER  PARTS/ 
GLOBE 

Effects  of  various  dietary  deficiencies  upon  the  morphology  of  the 

suprarenal  gland 
EFFECTS/VARIOUS  DIETARY  DEFICIENCIES/MORPHOLOGY/ 

SUPRARENAL  GLAND  (or,  to  force  re -tracing  of  the  second 

sector:) 
EFFECTS/VARIOUS  DIETARY  DEFICIENCIES: MORPHOLOGY/ 

SUPRARENAL  GLAND 
MORPHOLOGY/SUPRARENAL  GLAND 

Psychology;  a  factual  textbook  (form) 
PSYCHOLOGY;  a  factual  textbook 

The  history  of  educational  legislation  in  Ohio  from  1851  to  1925 

(scope) 
THE  HIST./EDUC.  LEGISLATION- -OHIO,  1851-19^5 

Boston  investigator,  devoted  to  the  development  and  promotion  of  uni- 
versal mental  liberty  (explanatory,  title  main  entry) 
BOSTON  INVESTIGATOR;  devoted.  .  . 

The  art  of  dressmaking  at  home  and  in  the  workroom  (scope) 
THE  ART/DRESSMAKING- -/HOME  +/WORKROOM 

Canada  under  British  rule,  1760-1905  (scope) 
CANADA  UNDER  BRITISH  RULE- -1760- 1905 

A  history  of  Philadelphia,  with  notice  of  villages,  in  the  vicinity 

(supplementary) 
A  HIST. /PHILADELPHIA;  /  notices/villages,/vicinity 

The  drama  and  music  in  New  South  Wales  (scope) 
THE  DRAMA  +  MUSIC- -NEW  S.  WALES 

The  layman's  handbook  to  Daniel 
The  LAYMAN'S  HDBK. /DANIEL 

The  Jacobins;  an  essay  in  the  new  history  (form) 
THE  JACOBINS;  an  essay.  .  . 

New  reading  room  and  libraries 
NEW  READING  ROOM  +  LIBRS. 
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British  standard  specification  for  the  dimensions  of  grinding  wheels 

and  methods  of  attachment  (initialism) 
SPECIFICATION/DIMENSIONS/GRINDING  WHEELS  +  METHODS  OF 

ATTACHMENT 

Lectures  illustrative  of  various  subjects  in  pathology  and  surgery 
LECTURES  ILLUS. /VARIOUS  SUBJECTS/PATHOLOGY  +  SURGERY 

The  Senate  finance  bill 

Sanctuary,  Sunshine  house  sonnets  (form) 
SANCTUARY- -SUNSHINE  HOUSE  SONNETS 

Lands  of  the  Andes  and  the  desert 
LANDS/ ANDES  +  THE  DESERT 

Catalogue  of  the  library  and  prints  of.  .  .  of.  .  . 
CATALOGUE/LIBR.  +  PRINTS/.  .  . 

The  crimson  cutlass 

The  Pan-American  conferences  and  their  significance 
THE  PAN-AMER.  CONFERENCES  +  THEIR  SIGNIFICANCE 

Summer  tours  (corporate  main  entry) 

The  blessed  bees 

Moses  Greeley  Parker,  M.D. 

A  tentative  bibliography  of  the  belles-lettres  of  the  Argentine  republic 

(scope) 
A  TENTATIVE  BIBL./BELLES-LETTRES/ARGENTINE  REPUBLIC 

The  third  part  of  the  Institutes  of  the  laws  of  England:  concerning 
high  treason,  and  other  pleas  of  the  crown,  and  criminal  causes 
(sectional) 

THE  3RD  PT./INSTITUTES/LAWS/ENG.:    CONG.  HIGH  TREASON,  + 
OTHER  PLEAS/ CROWN 

CONCERNING  HIGH  TREASON,  +  OTHER  PLEAS/CROWN 

English  teaching  in  the  Southwest;  organization  and  materials  for  in- 
structing Spanish-speaking  children  (scope/explanatory) 

ENG.  TE ACHING- -S.W.;  org.  +  materials/instructing  spanish- 
speaking  children 

Christian  salvation,  a  modern  interpretation  (explanatory) 
CHRISTIAN  SALVATION;  a  modern  interpr. 

Short  texts  from  Coptic  ostraca  and  papyri 
SHORT  TEXTS/COPTIC  OSTRACA  +  PAPYRI 

Meat  purchasing  a  science 

Suggestions  for  teaching  primary  reading  and  phonics 
SUGGESTIONS/TEACHING  PRIMARY  READING  +  PHONICS 
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Saint  Brigid  of  Ireland 

North  American  ichneumon-flies,  new  and  described,  with  taxonomic 

.  .  .  (suppl.) 
N.  AMER.  ICHNEUMON- FLIES- -NEW  +  DESCRIBED;  /taxonomic  + 


The  Greek  herbal  of  Dioscorides;  illustrated  by  a  Byzantine,  A.D.  512; 

Englished  by  John  Goodyer,  A.D.  1655;  edited  and  first  printed, 

A.D.  1933,  by  Robert  T.  Gunther  (form,  redaction,  internal  ref.  to 

main  entry) 
THE  GREEK  HERBAL;  illus./byzantine,  a.d.  512;  eng./goodyer,  a.d. 

1655--ED.  +  .  .  .--ED.  +  1ST  PR.,  A.D.  1933/GUNTHER 

Annelids  from  the  Danmark  expedition  (scope) 
ANNE  LIDS- -DANMARK  EXPEDITION. 

Old  Kensington  palace  and  other  papers 

OLD  KENSINGTON  PALACE  +  OTHER  PAPERS 

Doderlein's  Hand-book  of  Latin  synonyms  (initialism) 
HAND-BOOK/LATIN  SYNONYMS  (first  word  not  abbreviated) 

Statement  of  the  minister  of  the  Dominican  Republic,  .  .  .,  concerning 
the  incident  of  October  last,  in  one  section  of  the  Haitian- 
Dominican  border  (scope) 

STATEMENT  CONG.  THE  INCIDENT/OCT.  LAST;  in  1  sect./haitian- 
dominican  border 

Historical  sketch  of  the  Congregational  Church  in  Belchertown,  Mass., 
from  its  reorganization,  114  years,  with.  .  .  (scope) 

HISTORICAL  SKETCH/CONGREGATIONAL  CHURCH/ BELCHER- 
TOWN,  MASS. ;  from.  .  . 

Latin  grammar 

Madame  Therese;  or,  The  volunteers  of  '92  (alternative) 
MADAME  THERESE  (THE  VOLUNTEERS/'92) 
THE  VOLUNTEERS/'92 

Jane  Clegg;  a  play  in  three  acts  (form) 
JANE  CLEGG--A  PLAY;  /3  acts 

Nicholas  II,  prisoner  of  the  purple  (explanatory) 
NICHOLAS  2D--PRISONER/PURPLE 

"The  North  Pole",  a  tale  of  the  Arctic  sea;  a  100%  tone  and  speaking 
picture  with  songs,  choruses  and  dances  (explanatory/form/sup- 
plementary) 

THE  NORTH  POLE;  a  tale.  .  . 

The  margin  of  music 
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Two  Tudor  books  of  arms 

TWO  TUDOR  BKS./ARMS  (Numeral  word  not  transliterated  because 
initial) 

An  elementary  geography;  being  also  a  key  to  the  new  series  of  out- 
line maps  (suppl.) 
AN  ELEMENTARY  GEOGRAPHY;  being.  .  . 

Text -book  of  bacteriology 
TEXT-BK./BACTERIOLOGY 

Inventory  of  Memorials,  presented,  by  the  deputies  of  the  Council  of 
Trade  in  France,  to  the  Royal  Council,  in  1701  (initialism,  oc- 
casion, repet.  of  main  ent.) 

MEMORIALS,  PRESENT  ED/ ROYAL  COUNCIL;  /1701 

Paris  following  the  terror  (scope) 
PARIS- -FOLLOWING  THE  TERROR 

Romances  of  the  law 
ROMANCES/ LAW 

Map  of  southern  Idaho  and  the  adjacent  regions  (supplementary) 
MAP/S.  IDAHO;  +  the  adjacent  regions 

Simplified  spelling  from  the  scholarly  standpoint  and  with  reference 
to  the  masses:  the  argument  for  simplified  spelling  (explanatory, 
alternative) 

SIMPLIFIED  SPELLING;  /scholarly  standpoint  +  /masses  (THE 
ARGUMENT/SIMPLIFIED.  .  . 

THE  ARGUMENT/SIMPLIFIED  SPELLING 

A  manual  of  procedure  for  the  clinical  laboratory 
A  MANUAL/PROCEDURE/CLINICAL  LABORATORY 

How  I  became  a  preacher 

The  cementation  of  iron  and  steel 
THE  CEMENTATION/IRON  +  STEEL 

Methods  of  recording  retail  prices  and  measuring  the  cost  of  living 

in  Italy  (scope) 
METHODS/RECORDING  RETAIL  PRICES  +  MEASURING  THE  COST/ 

LIVING/ITALY 

Principles  of  American  forestry 
PRINCIPLES/ AMER.  FORESTRY 

Charter  rights  of  the  Greenville.  .  .  railroad  company  (ref.  to  main 

entry) 
CHARTER  RIGHTS 
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Man-midwifery  exposed  and  corrected;  or,  The  employment.  .  . 

(alternative) 

MAN- MIDWIFERY  EXPOSED  +  CORRECTED  (THE  EMPLOYMENT.  .  . 
THE  EMPLOYMENT.  .  . 

Indians  of  yesterday 
INDIANS/  YE  ST  E  RD  A  Y 

The  ancestress,  tragedy  in  five  acts  (form) 
THE  ANCESTRESS;  tragedy.  .  . 

Children's  toys  of  bygone  days;  a  history  of  playthings  of  all  peoples 

from  prehistoric  times  to  the  XDCth  century  (explanatory/scope) 
CHILDREN'S  TOYS/BYGONE  DAYS;  a  hist./.  .  . 

May  dust 

The  story  of  mankind;  science  of  anthropology- -man's  evolution- -his 
physical,  mental  and  moral  development  (explanatory,  alternative) 
THE  STORY/MANKIND  (SCIENCE/ ANTHROPOLOGY);  man's  .  .  . 
SCIENCE/ANTHROPOLOGY 

The  book  of  electrical  wonders 

THE  BOOK/ELECTRICAL  WONDERS 

The  little  schoolmistress 

The  first  (-second)  book  of  word  and  sentence  work;  or,  Easy  steps 
in  spelling  (inserted  content,  alternative) 

THE  1ST-2ND  BK./WORD  +  SENTENCE  WORK  (EASY  STEPS/SPELL- 
ING) 

EASY  STEPS/SPELLING 

Map  and  aerial  photo  reading  simplified 

MAP  +  AERIAL  PHOTO  READING  SIMPLIFIED 

How  not  to  treat  Illinois  soils 

Case  studies  in  the  development  of  social  attitudes 
CASE  STUDIES/DEV. /SOCIAL  ATTITUDES 

The  absorption  of  resonance  neutrons  by  boron,  chlorine,  cobalt  and 

manganese 
THE  ABSORPTION/RESONANCE  NEUTRONS/ BORON,  CHLORINE, 

COBALT  +  MANGANESE 

Our  debt  to  the  red  man;  the  French -Indians  in  the  development  of  the 

United  States  (explanatory/alternative) 

OUR  DEBT/RED  MAN  (THE  FRENCH-INDIANS/DEV./U.  S.) 
THE  FRENCH-INDIANS/DEV./U.  S. 

A  manual  of  the  timbers  of  the  world;  their  characteristics  and  uses 

(explan. ) 
A  MANUAL/TIMBERS/WORLD;  their.  .  . 
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Assessment  and  taxation  in  Kansas  (scope) 
ASSESSMENT  +  TAXATION --KANSAS 

The  Italian  navy  in  the  world  war  1915-1918  (scope) 
THE  ITALIAN  NAVY- -WORLD  WAR  1915-1918 

Body  form  in  growing  chickens 
BODY  FORM/GROWING  CHICKENS 

The  present  status  of  the  certification  of  teachers  in  the  United 

States  (scope) 
THE  PRESENT  STATUS/ CERTIFICATION/TEACHERS/U.  S. 

The  temperaments;  or,  The  varieties  of  physical  constitution  in  man, 
considered  in  their.  .  .  (alternative/explanatory) 

THE  TEMPERAMENTS  (THE  VARIETIES/PHYSICAL  CONSTITU- 
TION/MAN; cons,  /mental.  .  .) 

THE  VARIETIES/ PHYSICAL  CONSTITUTION/ MAN;  cons. /mental.  .  . 

Report  of  the  Central  board.  .  .  (corporate  main  entry) 

Shakespeare's  heroines;  characteristics  of  women,  moral,  poetical, 

and  historical  (explan.) 
SHAKESPEARE'S  HEROINES;  characteristics.  .  . 

The  memoirs  of  the  Marquise  de  Keroubec  (1785-1858)  being  ex- 
tracts. .  .  (explanatory/ form,  reference  to  main  entry) 
THE  MEMOIRS;  being.  .  . 

The  Peacock  farm 

Technology,  employment,  and  output  per  man  in  petroleum  and 

natural-gas  production  (scope) 
TECHNOLOGY,  EMPLOYMENT,  +  OUTPUT/MAN- -PETROLEUM  + 

NATURAL- GAS  PRODUCTION 

The  way  of  the  West 
THE  WAY/W. 

Poems;  including  The  saint's  tragedy,  Andromeda,  songs,  ballads,  etc. 

(contents) 
POEMS;  including:    THE  SAINT'S  TRAGEDY:    ANDROMEDA;  songs 


THE  SAINT'S  TRAGEDY 
ANDROMEDA 

A  digest  of  the  statutes  of  Arkansas,  embracing.  .  .  (explanatory/ 

scope) 
A  DIGEST/STATUTES/ARK. ;  embracing.  .  . 

The  adventurous  life  of  Count  Lav  alette,  Bonaparte's  aide-de-camp 

and  Napoleon's  postmaster -general  (explanatory) 
THE  ADVENTUROUS  LIFE/COUNT  LAVALETTE-- BONAPARTE'S. 
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Guy  Livingstone;  or,  "Through"  (alternative) 
GUY  LIVINGSTONE  (THROUGH) 
THROUGH 

The  book  that  gave  to  Iowa  its  name,  a  reprint  (formal) 
THE  BOOK  THAT  GAVE/IOWA  ITS  NAME;  a  repr. 

Nutrition 

Driven  to  bay 
DRIVEN/ BAY 

The  perception  of  light  and  color 
THE  PERCEPTION/LIGHT  +  COLOR 

Morphological  studies  of  the  head  and  mouth-parts  of  the  mature 
codling-moth  larva,  Carpocapsa  pomonella  (Linn.)  (explanatory) 

MORPHOLOGICAL  STUDIES/HEAD  +  MOUTH- PTS. /MATURE 
CODLING-MOTH  LARVA;  Carpocapsa.  .  . 

Madame  L'Orient's  art  of  beauty  (initialism) 
ART/BEAUTY 

The  American  vine  dresser's  guide 
THE  AMER.  VINE  DRESSER'S  GUIDE 

A  manual  of  modern  scholastic  philosophy 

A  MANUAL/MODERN  SCHOLASTIC  PHILOSOPHY 

An  oration,  pronounced  July  4,  1799,  at  the  request  of  the  inhabitants 
of  the  town  of  Boston,  in  commemoration  of  the  anniversary  of 
American  independence  (occasion) 

AN  ORATION;  pronounced.  .  .  Boston- -IN  COMMEMORATION.  .  . 

--IN  COMMEMORATION/ ANNIVERSARY/AMER. 
INDEPENDENCE 

The  book  of  prescriptions,  with  notes  on  the  pharmacology.  .  .  (sup- 
plementary) 
THE  BOOK/PRESCRIPTIONS;  /notes.  .  .  . 

Adventures  in  interviewing 
ADVENTURE/INTERVIEWING 

Catalogue  of  Marietta  library  (corporate  main  entry  referred  to) 
CATALOGUE 

The  horn-fly  (Haematobia  serrata  Rob. -Desv.)  (explanatory) 
THE  HORN- FLY;  Haematobia.  .  . 

The  law  of  insolvency  in  South  Africa  (scope) 
THE  LAW/INSOLVENCY--S.  AFRICA 

The  vegetation  of  twin  island 
THE  VEGETATION/TWIN  ISLAND 
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Martin's  natural  history  (initialism) 
NATURAL  HIST. 

Upper  night 

Churches  in  lower  Nubia  (scope) 
CHURCHES- -LOWER  NUBIA 

The  new  State  department 
What  is  wrong? 

Oil  recovery  investigations  of  the  Petroleum  experiment  station  of 

the  U.  S.  Bureau  of  mines 
OIL  RECOVERY  INVESTIGATIONS/PETROLEUM  EXPERIMENT 

STATION/U.  S.  BUREAU/MINES 

The  olive  branch;  or,  The  evil  and  the  remedy  (alternative) 
THE  OLIVE  BRANCH  (THE  EVIL  +  THE  REMEDY) 
THE  EVIL  +  THE  REMEDY 

The  doctrine  of  fascism 
THE  DOCTRINE/ FASCISM 

The  law  of  suffrage  and  elections 
THE  LAW/SUFFRAGE  +  ELECTIONS 

The  archaeological  collections  (corporate  main  entry) 
THE  ARCHAEOLOGICAL  COLLECS. 

Rules  and  practice  of  the  Supreme  Court  of  South  Africa,  Transvaal 

provincial  division,  Witwatersrand  local  division,  and  Appellate 

division  (contents) 
RULES  +  PRACTICE/SUPREME  COURT- -S.  AFRICA:    TRANSVAAL 

PROVINCIAL  DIVISION:    WITWATERSRAND  LOCAL  DIVISION: 

APPELLATE  DIVISION 

TRANSVAAL  PROVINCIAL  DIVISION   (a  more  desirable  result  would 
WITWATERSRAND  LOCAL  DIVISION   be  to  get  each  of  these  re- 
APPELLATE  DIVISION  tracings  as  a  continuation  of 

the  opening  phrase) 

Suggested  forms  for  internal  financial  reports  of  colleges  and  uni- 
versities (scope) 

SUGGESTED  FORMS/ INTERNAL  FINANCIAL  RE  PORTS/ COLLS.  + 
UNIVS. 

The  war  and  South  American  trade,  issued.  .  .  by.  .  .  (occasion,  refer- 
ence to  m.  e.) 
THE  WAR  +  S.  AMER.  TRADE;  iss.  .  .  . 

Report  of  the  secretary  of  the.  .  .  on  the  census.  .  .  1861  (ref.  to  corp. 

main  ent.) 
REPORT/CENSUS 
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Relation  of.  .  .,  tr.  from  the  Spanish  by  Buckingham  Smith  (reference 

to  main  entry,  redaction) 
RE  LATION-  -TR./SPANISH/SMITH 

Oaths  and  obligations  of  free  masonry,  as  published  to  the  world,  by 
a  convention  of  seceding  masons,  held.  .  .  (title  main  entry,  ex- 
planatory/occasion) 

OATHS  +  OBLIGATIONS/FREE  MASONRY;  as  publ. /world.  .  . 

A  Gael's  guide  to  Waterford  and  the  Deise  country 

A  GAEL'S  GUIDE/WATERFORD  +  THE  DEISE  COUNTRY 

The  church  of  Rome:    a  view  of  the  peculiar  doctrine,  religious  wor- 
ship, ecclesiastical  polity,  and  ceremonial  observances,  of  the 
Roman  Catholic  Church  (expl. ) 

THE  CHURCH/ROME;  a  view/ (THE  PECULIAR  DOCTRINE,  RE- 
LIGIOUS WORSHIP,  ECCLESIAST.  .  . 

THE  PECULIAR  DOCTRINE,  RELIGIOUS  WORSHIP,  ECCLESIASTI- 
CAL. .  . 

Poems.    Dedicated  to  Thomas  Moore,  esq.  (form) 
POEMS- -DEDICATED/THOMAS  MOORE,  ESQ. 

The  man  with  the  lantern 
THE  MAN/LANTERN 

Love  in  its  extasie;  or,  The  large  prerogative  (alternative) 
LOVE/EXT ASIE  (THE  LARGE  PREROGATIVE) 
THE  LARGE  PREROGATIVE 

Export  opportunities  in  South  and  Central  America  (scope) 
EXPORT  OPPORTUNITY  S/S.  +  CENTRAL  AMER. 

An  answer  to  the  remarks  of  the  Plymouth  company,  or  (as  they  call 
themselves)  the  Proprietors  of  the  Kennebeck  purchase.  .  .  on  the 
plan  and  extracts  of  deeds 

AN  ANSWER/REMARKS/PLYMOUTH  COMPANY;  or.  .  .  (LC  has  a 
tracing  "Plymouth  Company,  1749-1816.    Remarks  on  the  plan  and 
extracts  of  deeds"  which  probably  cannot  be  automatically  ex- 
tracted) 

A  handbook  to  the  palace  of  Minos  at  Knossos,  with  its  dependencies 

(scope,  suppl.) 
A  HANDBOOK/PALACE/MINOS/KNOSSOS;  /its.  .  . 

Nutrition  and  the  school  lunch 
NUTRITION  +  THE  SCHOOL  LUNCH 

Some  decisions  on  the  Property  acts 
SOME  DECISIONS/PROPERTY  ACTS 
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Taxation  in  New  York,  1924;  the  complete  New  York  tax  law.  .  . 

(scope,  explanatory) 
TAXATION--N.  Y.,  1924;  the.  .  . 

Essays  on  the  political  history  of  the  fifteenth,  sixteenth,  and  seven- 
teenth centuries 
ESSAYS/POL.  HIST./15TH,  16TH,  +  17TH  CENTURIES 

Church  music  in  the  light  of  the  motu  proprio;  a  guide  for  the  Catholic 

choir-master  and  organist  (explanatory) 
CHURCH  MUSIC/LIGHT/MOTU  PROPRIO;  a  guide.  .  . 

D.  Francisco  Manuel  de  Mello 

The  dollar  crop,  and  its  effect  upon  labor,  agriculture,  commerce 

and  manufacture 
THE  DOLLAR  CROP,  +  ITS  EFFECT/ LABOR,  AGRICULTURE,  .  .  . 

Reynold's  code.    Polyglot  nautical  telegraph  for  the  use.  .  .  (initial- 
ism,  explan.) 

CODE  (POLYGLOT  NAUTICAL  TELEGRAPH;  /use.  .  . 
POLYGLOT  NAUTICAL  TELEGRAPH;  /use.  .  . 

Acts  relating  to  the  public  schools  of  Rhode  Island,  with  remarks  + 

forms,    (scope,  supplementary) 
ACTS/PUBLIC  SCHOOLS/R.  I. ;  with.  .  . 

Rustic  architecture 

Fair  exchange;  a  novel  in  the  first  person  (form) 
FAIR  EXCHANGE;  a  novel.  .  . 

Memoirs  of  Duke  de  Richelieu  (reference  to  main  entry) 
MEMOIRS 

Racing  bits;  a  story  of  the  oil-fields  of  Texas  (form/explanatory) 
RACING  BITS- -A  STORY/OIL-FIELDS/TEX. 

Floats  for  the  so-called  fattening  of  oysters 
FLOATS/SO-CALLED  FATTENING/OYSTERS 

An  address  to  the  flocks  of  the  reverend  approvers  of  Blanco  White's 

internal  evidences  against  Catholicism 
AN  ADDRESS/FLOCKS/REVEREND  APPROVERS/BLANCO  WHITE'S: 

INTERNAL  EVIDENCES  AGAINST  CATHOLICISM 
INTERNAL  EVIDENCES  AGAINST  CATHOLICISM 

Obstetrical  notes  for  juniors  (occasion) 
OBSTETRICAL  NOTES/ JUNIORS 

Canary  breeding  for  beginners;  a  practical  up-to-date  guide  (oc- 
casion/form) 

CANARY  BREEDING/BEGINNERS;  A  PRACTICAL  UP-TO-DATE 
GUIDE 
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St.  Philip's  church.    Memorial  poem  (title  main  entry,  form) 
ST.  PHILIP'S  CHURCH.    MEMORIAL  POEM  (retention  of  original 
punctuation  leaves  whole  as  single  field) 

Annual  report  of  the  town  (corporate  main  entry) 
ANNUAL  REPORT/TOWN 

The  wayfarer;  a  novel  (form) 
THE  WAYFARER;  A  NOVEL 

A  bibliographical  manual  for  the  student  of  criminology  (occasion) 
A  BIBLIOGRAPHICAL  MAN. --/STUPE NT/ CRIMINOLOGY  (first  word 
not  abbrev.) 

Syllabus  of  Davis'  International  law 
SYLLABUS/DAVIS':    INTERNATIONAL  LAW 
INTERNATIONAL  LAW 

On  the  wing  through  Europe 

The  second  Sexton  cook  book  for  the  Sexton  market  (repetition  of 
main  entry,  but  not  given  as  an  initialism;  best  to  arbitrarily 
abridge  as:) 

THE  2D  COOK  BK. /SEXTON  MARKET 

All's  well  that  ends  well,  by.  .  .;  ed.,  with  notes,  introduction,  glos- 
sary, list  of  various  readings,  and  selected  criticism,  by.  .  . 
(redaction/form) 

ALL'S  WELL  THAT  ENDS  WELL--ED.;/NOTES,  INTRO.,  GLOS- 
SARY, LIST/VARIOUS  READINGS,  +  SEL.  CRITICISM;  BY.  .  . 
(traced  as:) 

ALL'S  WELL  THAT  ENDS  WELL--ED.  BY.  .  . 

Thanksgiving  plays  for  boys  and  girls,  a  collection  of  six.  .  .  (form, 

occasion) 
THANKSGIVING  PLAYS/GIRLS  +  BOYS;  A  COLLEC./6.  .  . 

Final  report  of  the.  .  .  (corporate  main  entry  repeated) 
FINAL  REPORT 

The  young  citizens  league;  helps  and  suggestions  for  organizing 

(form) 
THE  YOUNG  CITIZENS  LEAGUE;  HELPS.  .  . 

Not  a  sparrow  falls,  a  novel  (form) 
NOT  A  SPARROW  FALLS;  A  NOVEL 

Manifesto  of  the  Spanish  nation  to  Europe 
MANIFESTO/SPANISH  NATION/EUROPE 
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O  C  D,  outline  of  lectures  on  civilian  defense  (alternative  to  abbrevi- 
ation) 

O  C  D,  OUTLINE/LECTURES/CIVILIAN  DEFENSE  (Note  need  for 
spaces  in  abbrev.  entries) 

Sketching  methods 

The  Swiss  cross,  a  monthly  magazine  of  popular  science  (title  main 

entry,  form/explanatory) 
THE  SWISS  CROSS;  A  MONTHLY.  .  . 

Shelley 

The  Germania  of  Tacitus,  with  ethnological  dissertations  and  notes. 

By  R.  G.  Latham  (supplementary/redaction) 
THE  GERMANIA;/ETHNOLOGICAL  DISSERTATIONS  +  NOTES--BY 

R.  G.  LATHAM 
(traced  as:) 
THE  GERMANIA- -BY  R.  G.  LATHAM 

Carolina  humor;  sketches  by.  .  .  (form) 
CAROLINA  HUMOR;  SKETCHES.  .  . 

Brief  memorials  of  an  only  daughter 
BRIEF  MEMORIALS/ONLY  DAUGHTER 

The  distant  princess 

Tumminello  theory  of  numerical  roots  (initialism) 
THEORY/NUMERICAL  ROOTS 

The  minority  of  Henry  HL 
THE  MINORITY/HENRY  3D 

The  twin  sisters;  or,  The  effects  of  education:    a  novel;  in  a  series  of 

letters  (title  main  entry,  alternative,  form) 
THE  TWIN  SISTERS  (THE  EFFECTS/EDUC);  A  NOVEL.  .  . 
THE  EFFECTS/EDUC.;  A  NOVEL.  .  . 

A  history  of  philosophy,  from  Thales  to  the  present  time  (scope) 
A  HISTORY/PHIL.  ;/THALES.  .  . 

Under  the  crown,  American  history;  selections  from  the  Youth's  com- 
panion (title  main  entry,  explanatory,  form) 
UNDER  THE  CROWN- -AMER.  HIST.;  SEL./.  .  . 

The  Unitarian;  conducted  by.  .  .  (title  main  entry) 
THE  UNITARIAN;  .  .  . 

The  application  of  statistical  methods  to  the  problems  of  psycho- 
physics 

THE  APPLICATION/STATISTICAL  METHODS/ PROBLEMS/ 
PSYCHOPHYSICS 
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The  geology  of  the  country  around  Newton  Abbey  (scope) 
THE  GEOLOGY/COUNTRY  AROUND  NEWTON  ABBEY 

Studies  in  the  Asclepiadaceae 
ST  UDIE  S/ ASC  LE  PIADACE  AE 

An  essay  on  the  antiquity  of  the  Irish  language,  with  a  preface,  proving 

Ireland  to  be  the  Thule  of  the  ancients  (supplementary) 
AN  ESSAY/ ANTIQ. /IRISH  LANGUAGE;  WITH.  .  . 

Hand  book  of  the  United  States  tariff,  containing  the  tariff  act  of  1897 
rev.  to  July  1,  1902,  with  complete  schedules  of.  .  .  (explanatory) 
HAND  BK./  U.  S.  TARIFF;  cont.  .  . 

Deterioration  of  steels  in  the  synthesis  of  ammonia 
DETERIORATION/STEELS/SYNTHESIS/AMMONIA 

An  integrated  course  of  study  in  speech 

AN  INTEGRATED  COURSE/STUDY/SPEECH 

The  story  of  milk  and  how  it  came  about 
THE  STORY/MILK  +  HOW  IT  CAME  ABOUT 

Paintings  of  the  sea 
PAINTINGS/SEA 

An  introduction  to  English  grammar,  on  an  analytical  plan,  adapted 
to  the  use  of  students  in  colleges.  .  .  (form/occasion) 

AN  INTRODUCTION/ENG.  GRAMMAR,/ANALYTICAL  PLAN; 
ADAPTED.  .  . 

Laird  &  Lee's  vest-pocket  Webster  pronouncing  dictionary.  .  . 

(initialism  for  publ.) 
LAIRD  +  LEE'S  VEST-POCKET  WEBSTER  PRONOUNCING  PICT. 

Anticlinal  structure  in  parts  of  Cotton  and  Jefferson  counties,  Okla- 
homa (scope) 

ANTICLINAL  STRUCT URE/PTS./ COTTON  +  JEFFERSON  COUN- 
TIES, QKLA. 

The  analytical  chemist's  assistant:    a  manual  of  chemical  analysis, 
both  qualitative  and  quantitative  of  natural.  .  .  (explanatory) 

THE  ANALYTICAL  CHEMIST'S  ASSISTANT:    A  MANUAL.  .  . 

A  MANUAL/CHEMICAL  ANALYSIS,  BOTH  QUALITATIVE  +  QUANTI- 
TATIVE. .  . 

The  Greek  romances  in  Elizabethan  prose  fiction 

THE  GREEK  ROMANCES/ELIZABETHAN  PROSE  FICTION 

Memorials  of  James  Wood,  LL.D.,  J.P.  of  Grove  House,  Southport 
MEMORIALS/JAMES  WOOD;  LL.D.  .  . 
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The  silence,  thoughts  on  the  silence,  and  on  various  other  subjects 

(explanatory) 
THE  SILENCE;  THOUGHTS.  .  . 

Punches,  dies  and  tools  for  manufacturing  in  presses  (scope) 
PUNCHES,  DIES  +  TOOLS- -/MANUFACTURING  IN  PRESSES 

Social  life  at  the  English  universities  in  the  eighteenth  century  (scope) 
SOCIAL  LIFE/ENG.  UNIV./18TH  CENTURY 

The  filing  of  such  a  sample  of  entries  has  not  yet  been  under- 
taken, though  its  satisfactory  use  is  obviously  necessary  before  such 
a  compression  can  be  judged  acceptable.    However,  the  principles 
and  desiderata  enunciated  in  the  preceding  text,  together  with  the 
Hines/Harris  code,  should  produce  the  basis  on  which  to  attain  such 
an  acceptability.    The  two  major  problems,  of  course,  are  the  fil- 
ing of  abbreviations  with  other  similar-stemmed  words  (a  problem 
encountered  in  the  filing  of  L.C.  subject  headings,  with  their  com- 
pulsory abbreviations,  already),  and  of  the  filing  of  the  virgule 
(=prepositional-phrase- introducers)— this  last  particularly  in  that 
the  same  symbol  equivalated  (in  the  recommendations)  to  a  multipli- 
city of  such  prepositions,  from  a  ("at".  .  .)  almost  to  :z  ("with".  .  .). 


INFORMATION  RETRIEVAL  PROJECTS  IN  THE  BIO- 
MEDICAL  LIBRARY,  UNIVERSITY  OF  CALIFORNIA,  LOS  ANGELES 


Louise  Darling 


In  the  area  of  information  retrieval,  the  Biomedical  Library  is 
currently  engaged  in  two  projects:    the  UCLA  MEDLARS  Station  and 
the  Brain  Information  Service.    This  paper  is  limited  to  a  condensed 
and  non-technical  consideration  of  their  machine  programs.    The  two 
projects  are  interrelated  through  the  National  Library  of  Medicine's 
Medical  Literature  Analysis  and  Retrieval  System  (MEDLARS)  which 
must  therefore  be  our  starting  point.    At  the  outset  it  should  be  made 
clear  that  this  is  a  report  on  work  in  which  the  author's  role  is  pure- 
ly administrative.    The  progress  and  accomplishments  described  are 
due  to  the  efforts  of  the  staff  of  the  projects. 


UCLA  MEDLARS  Station 


MEDLARS  was  developed  at  the  National  Library  of  Medicine 
(NLM)  during  the  years  1961-1963.    It  operated  on  a  provisional  basis 
in  1963  and  in  1964  became  the  sole  system  for  producing  the  Index 
Medicus,  other  recurrent  bibliographies  in  special  subject  areas  and 
individual-oriented  demand  searches.    The  file  consists  of  citations 
in  sequential  order  on  magnetic  tape  with  each  citation  carrying  its 
own  set  of  descriptors.    For  each  record,  the  journal  title  and  de- 
scriptors are  in  code,  all  else  is  in  English.    Searches  are  batched, 
but  the  entire  file  (or  a  specified  portion  by  date)  is  searched  for  each 
request;  i.e.,  as  presently  designed,  the  file  is  not  open  to  random 
access.    Around  40,000  citations  were  put  into  the  system  in  1963, 
but  generally  searches  are  begun  with  the  1964  store  since  the  litera- 
ture representation  for  1963  is  so  incomplete.    Counting  from  1964, 
the  file  now  contains  over  350,000  citations  and  grows  currently  at  a 
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rate  of  approximately  14,000  citations  per  month.    The  descriptors 
comprise  the  Medical  Subject  Heading  List,  usually  referred  to  under 
its  acronym  as  MeSH.    MeSH  is  a  rigidly  controlled  vocabulary  with 
hierarchical  features.    It  and  other  components  of  the  system  will  be 
considered  later  in  relation  to  the  UCLA  Search  Station.    MEDLARS 
programs  are  written  in  machine  language  (Argus)  for  the  Honeywell 
800  Computer.    The  flow  chart  in  Figure  1  gives  a  very  general  idea 
of  the  system  as  it  operates  at  NLM.  1   The  system  is  under  continu- 
ous study  and  subject  to  frequent  revision. 

Recognizing  that  enormous  demands  would  be  placed  on  the 
system  if  it  lived  up  to  the  intent  of  its  design,  NLM  early  developed 
the  concept  of  reproducing  its  data  tapes  for  use  in  installations 
across  the  country,  thereby  expanding  manyfold  access  to  a,  if  not 
the  principal  part  of  the  world's  biomedical  literature.    Large  scale 
decentralization  presented  two  major  problems.    First,  the  programs 
required  rewriting  in  a  language  which  would  make  for  easy  adaptation 
to  other  computers  comparable  to  the  Honeywell  800;  second,  a  me- 
chanical conversion  of  data  from  the  3/4  inch  tapes  used  by  Honey- 
well to  the  1/2  inch  tape  used  by  IBM  and  other  equipment  must  be 
made.    Fortunately,  the  means  to  do  the  latter  at  relatively  low  cost 
had  already  been  developed.  2 

Plans  for  decentralization  went  forward.    In  November  1964, 
NLM  awarded  a  contract  to  the  University  of  California  at  Los  Ange- 
les to  rewrite  the  programs  in  the  COBOL  language  and,  following 
this,  to  establish  a  pilot  demonstration  search  center  which  would 
test  the  programs  under  actual  operating  conditions.    Only  programs 
for  the  retrieval  subsystem  were  specified,  of  course,  since  neither 
input  nor  publications  subsystems  are  a  part  of  the  decentralization 
plan.    The  renewal  contract  for  March  1966-February  1967  specifies 
the  Pacific  Time  Zone,  Alaska,  and  Hawaii  as  the  area  the  Station  is 
to  serve.    A  second  contract  was  awarded  in  January  1965  to  the 
University  of  Colorado  Medical  Center  to  operate  a  pilot  search 
center  using  compatible  equipment.    Under  the  direction  of  Dr. 
Frank  B.  Rogers,  this  station  successfully  served  UCMC  from  Febru- 
ary to  August;  then  under  terms  of  a  new  contract,  in  September  1965, 
opened  to  the  medical  community  in  the  Mountain  Time  Zone.    Much 
experience  valuable  to  the  whole  decentralization  program  has  been 
gained  from  the  pioneering  work  at  Denver.  3    Three  additional  sta- 
tions have  recently  been  authorized  to  serve  the  Northeast,  the  South, 
and  the  Midwest.    (See  Figure  2.)   These  will  use  the  UCLA  COBOL 
programs  modified  as  needed. 

Rewriting  the  MEDLARS  programs  at  UCLA  has  been  the  re- 
sponsibility of  the  Health  Sciences  Computing  Facility  (HSCF)  (which 
is  sponsored  by  the  Division  of  Facilities  and  Resources  of  the  Na- 
tional Institutes  of  Health  through  grant  FR-3.)  The  Facility  is  a  very 
busy  center  operating  twenty-four  hours  a  day,  seven  days  a  week.    It 
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deals  primarily  with  computations  and  data  processing  for  scientific 
research  conducted  in  the  Center  for  the  Health  Sciences.    Several 
off -campus  users  are  served  through  remote  inquiry  stations.    A 
staff  of  programmers  is  available  for  advice  and  assistance,  but  large 
projects  using  the  facility  usually  employ  additional  programmers. 
The  principal  equipment  complex  is  now  an  IBM  7040-7094  direct 
couple  configuration.    However  the  IBM  7040-7094  complex  is 
scheduled  to  be  replaced  well  before  the  end  of  the  calendar  year  by 
an  IBM  360  model  75  system.    Implementation  of  the  new  system  is 
now  under  way  and  users  are  preparing  for  necessary  computer  pro- 
gram modifications.    (The  IBM  7094  is  a  stand  alone  configuration 
but  is  almost  always  operated  in  a  direct  couple  mode.)    Its  runs  fall 
into  four  categories.    In  descending  priority  these  are:    Category  1, 
jobs  of  eight  minutes  or  less;  Category  2,  fifteen  minutes  or  less; 
Category  3,  thirty  minutes  or  less;  and  Category  4,  over  thirty 
minutes.    The  latter  generally  are  run  at  night  and  on  weekends.    (Ex- 
ceptions are  made,  of  course,  and  the  pattern  itself  adjusted  as  neces- 
sary to  the  overall  workload  in  the  Facility.)    The  Facility  is  thus 
geared  for  maximum  computation  time  efficiency  in  line  with  needs 
of  the  great  majority  of  its  users.    Those  who  do  not  fall  into  this 
group  must  adapt  to  it. 

Because  programmer  recruiting  is  always  a  problem  in  Southern 
California,  especially  for  fixed  term  projects,  HSCF  decided  to  sub- 
contract for  the  MEDLARS  conversion  and  at  the  same  time  to  train 
a  resident  MEDLARS  programmer  who  would  carry  on  as  part  of  the 
Search  Station  after  the  initial  job  was  completed.    The  subcontract 
was  awarded  to  the  Planning  Research  Corporation  (PRC)  and  has 
proved  a  satisfactory  arrangement.    Liaison  with  NLM  has  been  ef- 
fected through  its  Information  Systems  Division  and  via  the  rather 
extensive  training  program  in  MEDLARS  indexing  and  searching  tech- 
niques given  four  members  of  our  staff  who  spent  from  three  to  five 
months  each  at  NLM  during  1965.    The  vital  need  for  constant  inter- 
change between  the  parent  system  and  satellite  centers  cannot  be 
overemphasized,  especially  since  file  maintenance  and  updating  are 
handled  centrally.  2 

The  NLM-UCLA  contract  stipulated  that  the  system  retain  its 
ability  to  accept  all  forms  of  search  request  statements  and  to  pro- 
duce all  standard  report  forms.    However,  because  of  the  wide  differ- 
ences between  the  Honeywell  800  and  the  IBM  7094  in  both  hardware 
and  software,  the  UCLA  system  was  not  restricted  to  specific  guide- 
lines of  the  NLM  system,  although  the  system  logic  in  general  was 
to  be  retained.    The  COBOL  requirement  has,  in  most  respects, 
worked  out  satisfactorily,  for  not  only  is  the  language  adapted  to 
handling  large  quantities  of  MEDLARS  type  data,  but  it  is  a  fast  lan- 
guage for  writing  programs  for  systems  of  the  MEDLARS  type.    Dur- 
ing the  implementation  phase,  no  COBOL  compiler  nor  COBOL 
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systems  errors  were  found,  although  there  were  difficulties  due  to 
lack  of  IBM  documentation.    Use  of  COBOL  compared  with  machine 
language  has  not  lengthened  searching  time  quite  as  much  as  some 
predicted,  but  neither  is  it  proving  as  efficient  as  appeared  to  be 
indicated  six  months  ago.    The  section  most  costly  in  machine  time 
is  the  high  speed  or  gross  search  submodule  (HSS)  which  has  been 
averaging  two  to  two  and  one  half  hours  for  the  1964-1965  file.    Since 
the  HSS  is  based  on  retrieval  of  coded  descriptors,  not  on  English 
words,  it  would  be  feasible  to  redo  the  routines  for  it  in  machine 
language.    Some  installations  might  also  wish  to  consider  redoing 
some  of  the  Log  Search  routines. 4    (The  search  routine  programs 
were  rewritten  in  MAP  during  the  summer  of  1966  at  UCLA.) 

Problems  in  implementing  the  UCLA  system  have  not  been  as 
numerous  as  might  have  been  expected  in  view  of  the  wide  divergence 
between  the  UCLA  and  NLM  equipment.    There  were  however,  two 
serious  tape  file  problems:    tape  bad  spot  handling  and  character 
writing.    The  Honeywell  800  does  not  bypass  bad  spots  while  writing, 
but  the  IBM  7094  does  bypass  these  spots  in  reading,  thus  losing  data. 
The  second  problem  arose  in  the  tape  conversion  process.    The 
Honeywell  800  writes  only  in  multiples  of  eight  characters  (one  com- 
puter word  consisting  of  48  bits  in  8  bit  units)  while  the  IBM  7040, 
which  at  UCLA  handles  the  tape  for  the  IBM  7094,  works  in  multiples 
of  six  characters  (36  bits  to  the  word).  4   NLM,  which  handles  the 
first  step  in  the  conversion  in-house  through  a  satellite  of  the  Honey- 
well 800,  is  modifying  the  routines  for  this  step;  meanwhile  UCLA 
uses  the  IBM  7094  which  can  handle  eight  character  words  instead  of 
the  IBM  7040  to  complete  the  conversion.    Other  difficulties  with  tape 
reading  continued  for  some  time.    Among  the  trouble  causers  was 
the  closeness  of  the  load  point  to  the  first  record  and  the  use  of  high 
density  tape  (800  bits  per  inch),  apparently  corrected  by  return  to 
556  density.*   Another  stubborn  tape  read  problem,  although  not  one 
connected  with  the  conversion,  appeared  to  be  tape  stretching  on  the 
rewind  after  High  Speed  Search,  a  condition  which  necessitated  sever- 
al readjustments  of  the  tape  drives. 

A  word  needs  to  be  added  on  transference.    The  UCLA  pro- 
grams can  be  fairly  easily  modified  for  computers  of  like  size  and 
configuration,  providing  a  COBOL  compiler  is  also  available,  but,  to 
quote  Frank  Garvis,  project  manager  for  PRC,  "The  use  of  the  COBOL 
language  does  not  mean  MEDLARS  is  now  easily  transferable,  with 
only  minor  changes,  on  any  computer  configuration.    To  put  the  UCLA 
MEDLARS  system  on  smaller  computers,  such  as  IBM  1401  and  1410, 
would  require  major  and  costly  revisions  to  bring  it  within  the  con- 
fines of  the  normal  small  machine  hardware  and  software  configu- 
ration."4 


*The  rapid  growth  of  the  file  caused  us  to  try  high  density  tape 
again  in  the  summer  of  1966,  this  time  with  success. 
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There  have  been  a  number  of  changes  in  the  NLM  system  since 
the  UCLA  contract  was  awarded.    The  two  of  most  immediate  im- 
portance for  station  operation  are  the  addition  of  subheadings  to  the 
MeSH  as  of  January  1966,  and  program  preparation  for  the  inclusion 
of  citations  for  monographs.    The  latter  require  a  format  different 
from  that  for  journal  citations.    The  routines  for  these  additions  and 
a  program  to  annotate  the  master  citation  file  to  indicate  UCLA  cur- 
rent serial  holdings,  as  is  done  in  the  Colorado  Search  Station,  will  be 
completed  within  a  few  weeks  as  an  extension  of  the  subcontract  with 
PRC.    There  are  several  more  additions  to  the  UCLA  system  which 
the  resident  programmer  has  under  way  or  will  begin  soon.    Among 
these  are  the  option  for  restricting  the  HSS  to  a  selected  portion  of 
the  master  citation  file,  beginning  with  the  most  recent  citations,  a 
check  for  possible  loss  of  data  on  the  converted  tapes,  implementation 
of  the  option  for  listing  citations  under  any  desired  headings  used  for 
the  search,  allowance  for  printing  out  any  subset  of  a  search  or  any 
combination  of  subsets,  choice  of  specifying  tracings  for  one  subset 
and  not  for  another,  and  increase  in  the  number  of  terms  which  can 
be  specified  per  search.    (Main  heading  terms  or  descriptors  are  now 
limited  to  100,  for  example.) 

With  the  transfer  to  the  IBM  360  system,  computer  time  shar- 
ing and  on-line  display  devices  to  assist  in  both  file  maintenance  and 
searching  techniques  should  be  considered.    It  has  also  been  suggest- 
ed that  the  master  citation  file  might  be  inverted  so  that  each  de- 
scriptor carries  a  list  of  its  citations  instead  of  the  reverse  arrange- 
ment which  now  prevails.    Random  access  would  reduce  computer 
time,  but  such  a  change  would  also  enlarge  the  file  and  necessitate 
rather  complete  reprogramming.  4   NLM  is  at  present  planning  a 
study  of  redesign  of  the  system.    Obviously,  there  is  an  almost  un- 
limited horizon  for  study  and  for  change. 

Now  let  us  turn  to  a  brief  scrutiny  of  the  operation  pattern  we 
are  attempting  to  follow  at  UCLA.    It  corresponds  in  general  with 
that  of  NLM.    Figures  2-12  trace  the  journey  of  a  search  request. 
The  requests  on  the  Station's  form  (Figure  3)  are  sent  by  mail  or 
brought  in  person  to  the  Search  Station  where  they  are  assigned  for 
formulation  to  one  of  the  search  specialists.    The  most  important 
part  of  the  formulation  is  the  selection  of  descriptors  or  main  sub- 
ject headings  from  the  printed  MeSH  and  from  the  provisional  head- 
ings sent  us  by  NLM.    For  use  in  the  computer  search,  MeSH  is  put 
in  alphabetical  order  on  tape  and  in  this  form  known  as  the  Medical 
Dictionary  Tape  (MDT).    This  tape  also  contains  a  tally  of  the  number 
of  times  each  heading  has  been  used.    NLM  sends  the  Station  an  up- 
dated MDT  every  month.    There  is  not  time  to  discuss  MeSH  in  any 
detail,  but  it  should  be  remembered  that  descriptors  are  given  in 
both  alphabetical  and  category  position.    Some  categories  have  been 
exploded  for  machine  searching  to  form  tree  structures.    A  tree 
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structured  category  is  one  where  terms  have  been  hierarchically 
ordered  from  generic  to  specific,  using  four  levels  of  classification. 
The  tree  can  be  entered  on  any  of  the  four  levels  and  searched  for 
this  level  and  one  or  more  levels  under  it.    The  system  allows  for 
127  tree  structures,  many  of  which  are  in  the  process  of  develop- 
ment. 5    Citations  are  given  an  average  of  eight  to  ten  descriptors 
compared  with  an  average  of  two  or  three  in  the  printed  Index  Medi- 
cus. 

Searching  is  done  through  the  logical  operators  and  (*)  or  (+), 
and  not  (-).    An  important  feature  of  the  system  is  the  provision  for 
as  many  as  three  search  expressions  or  subsearches,  A  being  the 
most  general,  B  less  general,  and  C  the  most  specific.    This  makes 
it  possible  to  test  retrieval  hypotheses,  i.e.,  to  use  both  broad  and 
narrow  approaches  simultaneously.    The  relationship  between  the 
subsearches  is  used  in  selecting  the  most  significant  elements  for  the 
gross  or  high  speed  search,  for  what  satisfies  the  A  search  obviously 
should  cover  B  and  C  which  are  refinements  of  A.  5   All  searches  are 
not  however,  required  to  have  three,  or  even  two  expressions.    "A 
significant  element  is  a  term  or  search  element  with  the  lowest  index 
usage  in  a  search  and  therefore  of  highest  retrieval  value.    For  ex- 
ample, in  a  search  on  thalidomide  poisoning  in  rabbits,  the  significant 
element  is  thalidomide.    In  a  search  with  Or'd  expressions  there  may 
be  as  many  significant  elements  as  there  are  combinations.    Thus  in 
a  search  on  new  uses  of  dimethyl  sulfoxide  or  procaine,  both  drugs 
are  significant  elements.    On  the  other  hand,  for  recent  work  on  kid- 
ney transplantation  in  dogs  or  monkeys  there  is  just  the  one  signifi- 
cant element,  kidney  transplantation. "  2 

Figure  4  shows  the  side  of  the  formulation  sheet  on  which  all 
the  elements  for  the  search  have  been  listed,  including  unions  of 
terms.    This  is  the  Request  Element  Definition  List  or  RED  List. 
Figure  5  shows  the  other  side  of  the  formulation  record  where  the 
essential  information  about  the  requestor  and  the  request  are  given 
in  the  upper  section  and  the  search  equation  in  the  lower.    IBM  col- 
umns are  indicated  and  type  of  IBM  card  to  be  punched,  e.g.,  header 
cards  (02),  RED  List  cards  (03),  elements  and  operators  for  sub- 
searches  (04-06),  and  report  generator  cards  (07). 

A  decklet  of  punched  cards  for  the  search  is  prepared  from  the 
formulation  sheet,  the  decklets  batched,  then  sent  through  prelimi- 
nary Edit  (see  Figure  6).    This  is  done  as  an  additional  safeguard  to 
proof  subject  heading  cards  and  to  obtain  a  tally  of  the  number  of 
citations  which  the  significant  elements  can  be  expected  to  retrieve. 
Tallies  can  also  be  taken  manually  from  the  print-out  of  the  MDT  and 
added,  but  it  is  more  convenient  to  obtain  them  through  the  Edit  Pro- 
gram which  only  takes  3  to  4  minutes  to  run  and  may  save  hours  of 
time  by  detecting  errors  such  as  off-center  keypunching  and  mis- 
spelling in  advance  of  the  main  run.    The  tally  is  essential  for 
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estimating  the  time  it  will  take  to  run  the  batch.    This  information  is 
required  for  all  jobs  submitted  to  the  Computing  Facility.    After  pre- 
liminary Edit  some  components  of  the  batch  may  be  removed  for 
separate  running  if  the  tally  has  shown  a  very  high  retrieval  count. 
The  batch  or  batches  then  go  to  HSCF  with  the  proper  control  cards 
for  the  main  run  (see  Figure  7).    Note  the  Intermediate  Retrieved 
Citation  File  tape.    If  there  is  insufficient  time  for  the  entire  search, 
or  if  the  search  blows  up  in  Logical  Search,  the  citations  retrieved 
from  HSS  may  be  stored  off-line  on  tape  and  run  again  later  through 
Logical  Search.    In  Logical  Search  the  citation  output  from  HSS  is 
searched  against  the  search  formulations  in  such  a  way  that  only  the 
citations  which  logically  satisfy  the  requests  are  selected.    An  inter- 
mediate tape  with  the  results  of  Log  Search  may  also  be  saved  for 
later  processing  through  Report  Generator  if  there  is  need  to  do  so. 
Processing  of  the  batch  may  also  be  stopped  at  this  point  to  obtain  a 
statistical  report  on  number  of  citations  retrieved  per  search  so  that 
searchers  may  check  on  their  formulations  and  rewrite  them  if  the 
count  indicates  trouble.    Figures  8-10  illustrate  formats  in  which  the 
completed  search  bibliographies  may  be  obtained;  figures  11-12  make 
up  the  request  and  form  for  user  appraisal  of  the  search. 

Our  experience  thus  far  has  shown  batching  to  be  of  critical  im- 
portance.   The  MEDLARS  file  is  a  large  one,  growing  rapidly  larger. 
When  it  is  searched  with  the  COBOL  programs,  machine  time  used 
increases  markedly.    Thus  it  is  imperative  to  make  optimum  use  of 
available  time.    The  maximum  batch  allowed  by  the  UCLA  system 
appears  to  be  more  than  adequate— something  in  the  realm  of  100 
searches  with  an  average  of  50  machine  words  each,  the  limit  being 
set  by  the  request  table  for  the  coded  search  expression.    This  table 
provides  locations  for  a  total  of  5,250  machine  words.    The  present 
limit  can  be  raised,  but  it  is  important  to  keep  in  mind  that  as  batch 
size  increases,  the  difference  between  the  number  of  citations 
searched  in  the  HSS  and  the  number  searched  in  Log  Search  lessens, 
thereby  cutting  the  advantage  gained  from  the  gross  search  feature  of 
the  system.  2   The  possibility  of  tape  runover  in  HSS  with  large  batch- 
es, which  has  been  a  problem  in  the  Colorado  station,  causes  no  con- 
cern in  the  UCLA  station  because  disk  is  used  in  place  of  tape. 
(There  was  a  problem  when  intermediate  tape  files  were  stored  off- 
line, but  this  has  been  corrected  by  program  modification.)   The 
problem  in  our  case  is  running  over  estimated  time,  but  the  end  re- 
sult, inability  to  complete  the  search,  is  the  same. 

To  date  we  have  generally  used  batches  of  ten  to  fifteen  search- 
es, but  we  have  gone  as  high  as  thirty.    Processing  a  batch  through 
the  entire  file  generally  takes  from  four  and  one  half  to  five  hours. 
The  length  of  time  for  HSS  depends  primarily  on  the  size  of  the  file, 
not  the  size  of  the  batch.    The  only  batch  factor  of  importance  is  the 
proportion  of  significant  author  elements  to  significant  descriptor 
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headings.    Because  authors  are  in  English,  and  more  characters  are 
involved,  searching  them  takes  longer.    It  has  consistently  been  in 
the  range  of  1,400  to  2,000  citations  per  minute,  with  the  average 
usually  in  the  lower  end  of  the  range.    Because  of  the  relatively  limit- 
ed variation  in  time,  an  experiment  was  carried  on  with  the  30  batch 
search.    The  Intermediate  Retrieved  Citation  File  generated  in  HSS 
was  used  as  a  selected  CCF.    The  batch  was  broken  into  a  group  of 
smaller  batches,  each  of  which  was  run  in  HSS  against  the  selected 
CCF,  then,  as  time  was  available,  through  Log  Search  and  Report 
Generator  where  time  used  is  a  direct  expression  of  the  number  of 
citations  retrieved  to  satisfy  the  search  request  statement.    This  pro- 
cedure made  it  possible  to  submit  some  computer  priority  2  and  3 
jobs  and  thus  to  obtain  results  much  earlier  than  is  possible  with 
priority  4  jobs.    In  addition  to  this  bonus,  batch  segmentation  makes 
MEDLARS  jobs  less  liable  to  blow  up  because  the  probability  of  en- 
countering errors  is  smaller.    Moreover,  if  there  is  a  blow-up,  the 
penalty  in  time  lost  is  not  so  great,  although  the  total  amount  of  ma- 
chine time  used  by  this  type  of  batching  is  greater  than  the  more 
direct  procedure  providing  the  latter  does  not  run  into  tape  read 
errors  or  other  trouble. 

These  may  very  well  prove  important  considerations  for  routine 
operations;  they  are  of  particular  value  during  the  developmental 
period  when  search  failure  and  search  completion  are  in  decidedly 
unfavorable  balance.    A  multiplicity  of  problems  is  normal  for  any 
volume  testing  period,  but  in  our  case  they  have  been  compounded  by 
the  length  of  the  MEDLARS  file  and  the  sophistication  of  the  computer 
configuration.    On  the  first  score,  for  example,  MEDLARS  already 
has  well  over  300,000  records  and  the  maximum  acceptable  margin 
of  error  on  HSCF's  IBM  7040-7094  system  is  one  error  in  300,000 
records,  or  a  mean  free  error  time  of  forty  minutes.    On  the  second 
score,  running  time  varies  for  a  MEDLARS  job  in  Edit  or  Report 
Generator,  where  input -output  is  an  important  factor,  with  the  nature 
of  other  jobs  going  through  the  IBM  7040.    The  Edit  routine,  for  ex- 
ample, has  varied  from  3  to  5  minutes  on  identical  search  batches  at 
different  times.    Another  problem  is  that  the  HSCF  computer  system, 
fails  to  skip  over  tape  errors  and  instead  throws  out  the  search. 
This  procedure  is  an  advantage  for  mathematical  computations  since 
an  error  anywhere  along  the  way  invalidates  the  results,  but  in 
library-type  manipulations  it  has  a  high  cost  in  lost  time.    If  this 
were  changed,  the  error  could  be  identified  by  manual  examination  in 
most  cases  and  the  printout  corrected  manually  also.    In  any  case,  all 
the  searches  could  then  be  rerun  as  part  of  another  batch. 

The  reprogramming  called  for  in  the  first  contract  was  largely 
completed  by  November  1965.    The  first  testing  of  the  system  as  a 
whole  was  done  by  running  parallel  searches  with  a  group  which  had 
already  been  done  at  NLM  and  the  Colorado  station  against  the  UCLA 
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programs.    Retrieval  proved  to  be  identical.    We  plan  to  continue 
comparisons  periodically  and  to  ask  NLM  to  run  a  sampling  of  UCLA 
searches. 

As  of  April  15,  1966,  the  Station  has  accepted  93  original 
searches  of  which  32  have  been  completely  processed  and  returned 
to  the  requestors.    Requestor  reaction,  searcher  check,  and  evaluation 
of  results  have  all  been  reassuring.    The  other  searches  have  all  been 
formulated  and  are  in  various  stages  of  machine  search,  most  of  them 
at  Report  Generator  whose  programs  have  been  the  last  to  receive  a 
thorough  working  over.    Once  the  Station  is  operating  routinely  and 
well,  requests  should  be  processed  in  two  weeks,  with  emergency 
service  in  one,  providing  machine  time  is  available  and  search  vol- 
ume within  reason,  but  we  cannot  yet  predict  how  long  it  will  be  be- 
fore this  happy  state  of  affairs  is  at  hand. 

Statistics  are  collected  from  all  batches  on  the  number  of 
searches  in  the  batch,  the  machine  time  for  Edit,  HSS,  Log  Search 
and  Report  Generator,  the  number  of  citations  searched,  retrieved 
and  printed,  the  number  of  significant  elements,  and  the  number  of 
citations  printed  with  tracings  or  groups.    The  latter  and  the  number 
of  citations  printed  are  also  kept  for  individual  searches  as  is  the 
processing  time  from  arrival  of  request  to  return  of  the  MEDLARS 
produced  bibliography.    However,  the  small  volume  handled  thus  far 
does  not  lend  itself  to  support  of  any  generalizations,  and  the  stop 
and  repeat  manner  in  which  searching  has  proceeded  in  the  test  peri- 
od makes  figures  for  total  length  of  time  to  process  requests  mean- 
ingless. 

Timing  comparisons  between  the  COBOL  and  the  NLM  machine 
programs  are  of  special  interest,  but  there  are  as  yet  no  reliable 
data  on  which  to  base  them.    There  is  some  indication  that  the  NLM 
programs  may  be  nearly  three  times  faster  in  HSS  and  twice  as  fast 
in  Log  Search.    Transfer  to  the  IBM  360  System  should  improve 
present  timing  considerably,  perhaps  by  a  factor  of  one,  but  it  is  now 
obvious  that  for  stations  running  on  IBM  7090  or  7040  series  equip- 
ment, the  search  routines  must  be  returned  to  machine  language. 

Change  and  experiment  are  certain  to  be  the  most  constant 
characteristics  of  the  UCLA  Station  for  the  next  several  months,  in 
fact  for  the  next  year,  but  our  experience  to  date  should  greatly  fa- 
cilitate the  tooling  up  period  for  the  three  new  stations  now  in  process 
of  organizing. 

Brain  Information  Service 

The  UCLA  MEDLARS  Station  has  also  made  a  substantial  con- 
tribution to  the  Brain  Information  Service  (BIS)  which  was  established 
at  about  the  same  time  in  1964  under  contract  with  the  National 
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Institute  of  Neurological  Diseases  and  Blindness.    Its  purpose  is  to 
provide  information  in  the  basic  brain  sciences  rapidly,  accurately, 
and  completely  to  investigators  and  teachers  in  this  field.    The  sub- 
ject area  of  the  Service  includes  neuroanatomy,  neurophysiology, 
neuroendocrinology,  neurochemistry,  neuropharmacology,  and  non- 
clinical  aspects  of  neuropathology.    The  literature  of  diagnosis  and 
treatment  of  neurological  diseases  is  not  within  its  scope. 6 

For  convenience,  the  Service  is  referred  to  in  this  paper  as  a 
Biomedical  Library  project,  but  it  is  only  partly  so  as  the  organiza- 
tion chart  in  Figure  13  indicates.     The  Scientific  Unit  is  responsible 
for  conferences  and  workshops,  stimulation  of  the  writing  of  critical 
reviews,  a  study  of  the  information  gathering  habits  of  scientists 
working  in  the  brain  sciences,  and  for  formation  of  the  glossary 
which  is  the  vocabulary  used  by  the  computer  in  searching.    The 
Bibliographic  Unit  is  responsible  for  input  for  the  computer  store 
and  for  the  search  and  current  alerting  service,  both  manual  and 
automated.    At  the  present  time  only  manual  service  is  available,  and 
it  is  limited  largely  to  members  of  the  UCLA  Brain  Research  Insti- 
tute who  serve  as  a  users  test  group.    On  request  several  investi- 
gators in  other  California  institutions  and  in  other  parts  of  the  United 
States,  as  well  as  a  few  abroad,  have  been  added  to  the  group.    Toward 
the  end  of  1966  when  the  computer  programs  will  have  been  thoroughly 
tested  and  the  glossary  more  adequately  developed,  the  Service  will 
be  opened  on  a  national  scale.    The  programs  are  all  written  except 
for  the  Report  Generator  and  Publication  Modules  which  are  due  next 
month  and  a  glossary  maintenance  program  which  will  not  be  ready 
for  some  months  yet. 

BIS  uses  the  same  equipment  in  the  Health  Sciences  Computing 
Facility  as  does  MEDLARS  and  also  has  handled  its  initial  program- 
ming through  subcontracts  with  the  Planning  Research  Corporation. 
Every  effort  has  been  made  to  secure  compatibility  between  the  two 
projects  where  this  is  practical  and  to  keep  personnel  on  one  informed 
of  the  operations  of  the  other  and  vice  versa.    The  BIS  programs 
have  been  written  in  COBOL,  and  their  modifications  for  the  new  IBM 
360  System  are  being  planned  jointly  with  those  for  the  MEDLARS 
Station  with  the  help  of  the  Computing  Facility. 

In  addition  to  MEDLARS-type  demand  search  routines,  BIS 
programs  fall  into  two  groups:    input  and  glossary.    Their  end  prod- 
ucts produce  a  master  citation  file  which  may  be  searched  with  the 
same  type  of  formulation  as  is  employed  for  MEDLARS  although  de- 
scriptors are  drawn  from  a  quite  different  type  of  vocabulary.    The 
vocabulary  will  be  described  along  with  the  glossary  programs. 

The  sources  of  input  for  the  BIS  data  base  are  shown  in  Figure 
14.    The  first  is  the  magnetic  tape  which  brings  a  cut  of  the  UCLA 
MEDLARS  file  drawn  off  by  recurrent  demand  search,  beginning  with 
the  1964/65  file,  monthly  thereafter.    Next  are  citations  on  punched 
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cards  from  the  retrospective  literature  which  includes  a  selection 
from  the  Cumulated  Index  Medicus,  1960-1963,  and  from  bibliogra- 
phies compiled  manually  by  subject  area  specialists  in  the  Bibliogra- 
phic Unit.    A  selection  from  other  sources  such  as  Biological  Ab- 
stracts and  the  Current  List  of  Medical  Literature  will  be  added  later 
as  time  permits.    Citations  from  a  small  core  of  the  most  important 
current  journals,  which  are  required  for  the  system  as  soon  as  pub- 
lished, as  well  as  a  group  of  biological  journals  not  included  in 
MEDLARS,  are  indexed  by  the  area  specialists  to  form  a  third  type 
of  input.    Current  symposia,  conference  proceedings,  and  other  multi- 
author  volumes  indexed  by  the  area  specialists  form  a  fourth  input. 
Overlapping  between  citations  from  the  MEDLARS  system  and  other 
input  sources  is  deleted  by  the  merge  program  from  which  the  BIS 
Compressed  Citation  File  is  generated.    Figure  15  gives  an  estimate 
of  the  size  of  representation  of  the  various  types  of  input  in  the  file. 
A  very  important  component  not  shown  is  the  backlog  of  retrospective 
citations  still  to  be  keypunched  from  the  Cumulated  Index  Medicus 
1960-1963,  which  has  been  only  60  to  65  percent  completed,  and  from 
other  bibliographies.    Information  on  research  in  progress  in  some 
fields  is  being  received  from  the  Science  Information  Exchange,  but 
plans  have  not  yet  been  developed  for  incorporation  of  this  infor- 
mation into  the  computerized  system. 

The  citation  entry  record  (see  Figure  16  a-c)  is  arranged  for 
easy  transference  to  keypunch  cards.    It  was  designed  to  be  as  flex- 
ible as  practical  and  to  include  space  for  certain  features  which  would 
be  desirable  to  add  at  some  future  time  such  as  indexing  values  for 
terms  and  relationships  for  terms.    (See  Figure  16a,  Format,  Col- 
umns 73-80.)   Incidentally,  the  word  "Link"  as  used  on  this  form 
means  simply  that  cards  and  columns  are  linked  in  the  case  of  run- 
over  in  number  needed  (see  Figure  16b).    Role  indication  columns  at 
present  are  used  only  to  show  the  source  of  the  term. 

The  "10"  cards  give  all  the  usual  bibliographical  information 
about  the  citation,  name  of  indexer,  sequence  number  for  citation, 
and  source  of  citation.    The  "20"  cards  are  subject  term  cards.    Ab- 
stracts, when  included,  go  on  "30"  cards,  and  authors  and  editors  go 
on  "40"  cards.    Abstracts  may  consist  of  editorially  selected  portions 
of  the  text  of  the  article.    All  authors,  regardless  of  number,  are 
punched  and,  when  the  citation  is  retrieved,  printed  out.    Titles  in 
English,  both  original  and  translated,  go  on  "50"  cards,  and  titles  in 
the  vernacular  or  transliterated  go  on  "60"  cards.    There  are  as  yet 
no  "70"  cards.    The  "80"  cards  are  for  the  senior  author's  address, 
and  the  "90"  cards  are  for  the  names  of  the  scientists  who  have  re- 
quested information  on  the  subject  of  the  article.    ("90"  card  infor- 
mation is  gathered  for  a  small  separate  program  which  is  part  of  a 
current  alerting  service.)   Sort  criteria  are  journal  title  code,  volume 
number,  year,  starting  page,  and  senior  author.    All  five  must  be 
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punched  in  order  to  add,  delete,  or  update  a  citation  because  these 
are  also  the  merge  criteria.    In  the  case  of  chapters  in  multi-author 
volumes,  (BK)  is  used  for  journal  title  code  and  zeros  wherever  else 
something  non-existent  in  the  citation  must  be  supplied. 

Programs  convert  the  various  inputs  to  the  standard  tape  for- 
mat for  which  the  citation  entry  record  was  designed,  merge  identical 
citations  into  one  record  which  contains  all  information  from  all 
sources,  and  extract  words  from  titles  for  mechanical  indexing  of 
citations.    (See  Figure  17.)    The  extracted  title  words  and  all  other 
descriptors  attached  to  the  citation  are  then  matched  against  the 
glossary  tape.    If  the  words  are  recognized  as  indexable,  they  are 
added  to  the  citation  as  coded  tags.    If  the  words  prove  to  be  syno- 
nyms, the  terms  with  which  they  are  synonymous  are  substituted  and 
assigned  to  the  citation,  but  if  they  are  found  to  be  non-indexable,  no 
further  action  is  taken.    At  present  we  are  entering  only  English 
words  as  potential  subject  headings,  but  computer  routines  allow  any 
Roman  alphabet  language  to  be  used.    Words  not  found  in  the  glossary 
are  printed  out  as  warnings  to  the  glossary  editor  who  must  then 
make  the  decision  on  the  status  to  be  given  the  new  word  in  the  glos- 
sary:   indexable  as  subject  heading,  indexable  as  synonym,  or  non- 
indexable.    Once  this  is  done,  a  second  run  can  be  made  for  the  pur- 
pose of  attaching  the  new  terms  to  the  citation.    The  updated  master 
citation  file  is  then  ready  for  searching  by  MEDLARS  strategies. 

Like  citations  in  the  master  file,  terms  in  the  glossary  are 
drawn  from  MeSH,  whose  descriptors  are  appended  to  the  citations 
from  MEDLARS,  from  the  subject  headings  used  in  Index  Medicus 
1960-1963,  from  words  assigned  by  BIS  area  specialists  who  double 
as  indexers,  and  from  words  in  titles.    Indexers  assign  terms  from 
the  text  of  the  article,  generally  without  reference  to  the  BIS  glos- 
sary.   There  is  no  limit  to  the  number  of  words  in  a  term  assigned 
by  indexers  except  that  potential  descriptors  in  the  glossary  are 
limited  to  forty-eight  characters  and/or  spaces.? 

As  previously  noted,  all  citations  entering  the  system  from  any 
source  are  mechanically  indexed  from  words  occurring  in  their  titles. 
Terms  are  also  drawn  mechanically  from  abstracts  introduced  on 
BIS  citation  entry  records.    These  terms  consist  of  every  single  word 
and  every  two  words  in  sequence  except  for  a  list  of  127  words  which 
are  inadmissable,  such  as  A,  ALSO,  BEFORE,  IN,  IS,  OF,  STUDIES, 
THE,  TO,  UPON,  USING,  WHEN,  and  WITH.    These  are  mechanically 
rejected  on  examination  by  the  programs.    In  the  title  "Electroenceph- 
alographic  studies  in  experimental  pulmonary  oedema"  the  potential 
terms  are  ELECTROENCEPHALOGRAPHIC,  EXPERIMENTAL, 
PULMONARY,  OEDEMA,  EXPERIMENTAL  PULMONARY,  and  PUL- 
MONARY OEDEMA.    Routines  do  not  assign  a  single  word  descriptor 
to  an  article  when  that  word  also  occurs  in  a  two-word  sequence, 
thus,  PULMONARY  OEDEMA  but  not  PULMONARY  alone  or  OEDEMA. 
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If  this  restriction  proves  inappropriate,  it  can,  of  course,  be  changed 
later.    In  the  mechanical  indexing  program,  more  than  two  terms  in 
sequence  are  inadmissable.    Thus  for  CIRCLE  OF  WILLIS,  only  the 
single  words  CIRCLE  and  WILLIS  will  be  picked  up. 

The  BIS  glossary  is  an  open-ended  vocabulary  based  on  natural 
language.    Control  is  provided  by  the  glossary  editor  who  is  responsi- 
ble not  only  for  decisions  on  which  terms  will  be  added  as  indexable 
and  which  declared  non- indexable,  but  also  relates  each  indexable 
term  to  its  generically  more  general  and  more  specific  terms. 

Procedures  in  glossary  formation  are  illustrated  in  Figures  18- 
20.    Figure  18  is  a  printout  from  the  alphabetical  section  of  the  glos- 
sary (which  in  its  computerized  form  bears  the  unfortunate  acronym 
of  BISH  for  Brain  Information  Subject  Headings).    The  numbers  in 
the  first  column  at  the  left  are  codes  for  all  indexable  terms  whether 
subject  headings  or  synonyms.    Non-indexable  terms  are  indicated  by 
a  zero.    Terms  with  the  same  numbers  make  up  a  synonym  set.    Li 
the  next  column,  the  figure  1  indicates  both  a  subject  heading  and  a 
non-indexable  term,  2  a  synonym.    Under  "tag  source"  is  the  letter  I 
used  for  an  indexable  MeSH  or  non-MeSH  heading,  M  for  a  non- 
indexable  MeSH  term  and  N  for  a  non-indexable  non-MeSH  term.    All 
MeSH  terms  will  become  indexable  as  soon  as  citations  using  them 
enter  the  computerized  system.    They  are  entered  in  the  glossary  as 
unindexable  now  for  convenience  of  the  editor  who  will  not  place  them 
in  the  hierarchy  until  they  appear  for  the  first  time  on  citations.    The 
"frequency  column"  is  used  to  show  the  number  of  times  a  synonym 
set  has  been  assigned  to  a  citation.    In  this  case  all  are  zeros  because 
only  an  initial  small  group  of  citations  had  been  entered  into  the  sys- 
tem at  the  time  of  the  printout,  and  the  second  run  for  the  glossary 
update  had  not  yet  been  made.    The  "specificity  column"  will  not  be 
used  until  later.    Then  it  will  give  an  index  value  for  the  term,  prob- 
ably 1,  2,  or  3  representing  general,  less  general,  and  least  general; 
or  perhaps  just  1  and  2,  general  and  specific.    The  index  value,  in 
turn,  will  be  used  to  review  the  specificity  with  which  citations  have 
been  indexed.    All  citations  whose  subject  tags  total  less  than  a  given 
value  will  require  further  indexing.    Details  are  yet  to  be  planned.    In 
reviewing,  for  example,  it  may  prove  more  effective  to  omit  the  value 
number  for  general  from  the  total  count  for  the  citation  since  the 
purpose  of  the  review  is  to  increase  the  specificity.    Since  many  ci- 
tations will  enter  the  system  with  subject  terms  drawn  only  from 
mechanical  extraction  from  titles,  a  review  of  indexing  effectiveness 
is  essential.    The  "main  heading  column"  lists  all  the  descriptors  in 
alphabetical  order.    Note  in  this  column  the  terms  coded  36120.    These 
are  part  of  a  synonym  set,  with  the  term  electroencephalography  indi- 
cated as  the  preferred  heading,  the  other  seven  terms  as  synonyms. 
Members  of  a  synonym  set  may,  and  usually  are,  separated  in  the 
alphabetical  listing.    In  the  Numerical  BISH  (see  Figure  19)  into 
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which  the  Alphabetical  BISH  is  sorted  for  the  purpose  of  attaching 
subject  headings  in  code  to  citations,  the  order  is  numerical,  and  all 
synonyms  are  brought  together  without  regard  to  alphabetical  order. 
Thus  the  complete  synonym  set  for  electroencephalography  can  now 
be  seen  to  consist  of  fifteen  members.    More  may  be  added  later  as 
the  glossary  expands. 

Figure  20  shows  warnings  for  two  citations  from  the  system  to 
the  editor.    It  presents  him  with  eight  terms  on  which  he  must  make 
decisions  for  the  glossary.    The  other  fourteen  terms  are  already  in 
the  glossary  as  either  indexable  or  non-indexable  terms. 

Figure  21  illustrates  the  hierarchical  features  of  the  glossary. 
At  present  the  hierarchy  is  maintained  clerically  and  used  chiefly  to 
locate  synonym  sets  in  the  glossary  when  new  terms  are  under  con- 
sideration for  inclusion  in  a  given  set.    The  order  is  not  necessarily 
one  which  is  appropriate  for  explosion  of  terms  in  a  search;  in  many 
cases  the  editor  will  later  rearrange  it  for  more  effective  explosion 
of  general  terms.    Moreover,  there  are  as  yet  no  means  for  mechani- 
cally exploding  terms,  although  a  program  is  planned  to  provide  this 
capability  and  to  make  it  possible  to  enter  for  retrieval  at  any  level 
rather  than  to  follow  the  tree  structure  pattern  in  MEDLARS  which 
allows  only  four  levels  of  specificity. 7 

A  glossary  maintenance  program  which  will  eliminate  a  good 
part  of  the  clerical  upkeep  for  building  the  glossary  and  will  provide 
a  more  rigorous  check  for  errors  is  expected  to  be  ready  soon  for 
modification  by  BIS.    The  program  will  replace  the  checking  for  lo- 
cations in  the  hierarchy  and  the  adding  of  synonyms  manually  as  well 
as  the  manual  insertion  and  positioning  of  preferred  new  subject 
headings.    This  program  has  been  developed  by  IBM  for  Dr.  Baldwin 
Lamson's  research  at  UCLA  on  recording  and  retrieving  clinical 
pathological  records  in  natural  language. 

The  routines  used  for  development  of  the  glossary  will  obvious- 
ly increase  its  size  very  rapidly  as  the  first  citations  are  entered 
into  the  system.    They  will  also  produce  a  very  large  indexing  vo- 
cabulary whose  rate  of  increase  per  entered  citation  will,  however, 
eventually  be  quite  low  because  most  of  the  potential  descriptors  will 
already  be  in  the  glossary.    As  of  March  1,  1966,  the  glossary  con- 
tained over  6,000  indexable  terms  in  about  5,000  synonym  sets  and 
1,100  non-indexable  terms.    It  does  not  appear  that  the  size  of  the 
glossary  will  have  a  significant  effect  on  machine  time  required  to 
operate  the  system.    Initial  experience  is  that  approximately  twenty 
minutes,  including  time  for  input  from  card  to  tape,  are  required  to 
take  the  terms  from  a  batch  of  200  citations.    The  major  segment  of 
time  for  this  routine  is  used  in  sorting  single  and  double  terms  into 
alphabetical  sequence  after  the  trivial  words  are  removed.    A  maxi- 
mum of  600  synonyms  can  be  added  at  one  time,  but  there  is  no  limit 
on  the  number  of  preferred  subject  headings  which  may  go  in 
together.'' 
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The  size  of  the  glossary  and  its  continuous  growth  obviously 
complicate  glossary  maintenance  and  use,  but  the  glossary  has  ad- 
vantages which  we  hope  will  more  than  compensate  for  its  disadvan- 
tages.   One  of  these  is  mechanical  indexing  from  titles,  a  method 
which  is  very  cheap  indeed  for  it  involves  no  indexer  time  at  all  and 
no  extra  keypunching.    On  the  other  hand,  it  does  not  generally  pro- 
vide depth  adequate  for  needs  of  a  service  such  as  the  BIS.    However, 
applied  to  abstracts  this  method  is  likely  to  prove  very  valuable  and 
involves  little  more  than  the  additional  cost  of  keypunching  the  ab- 
stracts at  the  time  they  are  added  to  the  citation  entry  record.    The 
cost  of  uniformly  adding  abstracts  to  the  system  is,  of  course,  another 
thing  apart  from  their  use  for  indexing.    In  any  case,  it  has  never 
been  the  plan  to  rely  entirely  or  even  heavily  on  mechanical  indexing, 
but  to  use  it  as  a  method  to  build  up  a  file  quickly  and  passably  well. 
The  screening  for  specificity  of  indexing  referred  to  earlier  will 
make  it  possible  to  bring  all  citations  up  to  about  the  same  level  of 
indexing  eventually.    Meanwhile  the  mechanical  indexing  serves  as  a 
very  useful  supplement  to  what  is  brought  in  from  MEDLARS  indexing 
and  BIS  staff  indexing. 

Another  advantage  is  that  human  indexing  can  be  done  less  la- 
boriously with  this  type  of  glossary  than  with  a  rigidly  controlled 
vocabulary.    Since  words  from  texts  are  what  BIS  indexer s  are  asked 
to  use  when  possible,  the  indexer  need  not  look  up  terms  but  can  sub- 
stitute underlined  text  for  filled  out  indexing  sheets.    Further,  intelli- 
gent but  unskilled  indexers  can  perhaps  be  utilized  for  some  of  the 
indexing  because  it  is  generally  evident  from  the  syntax  which  de- 
scriptors should  be  chosen  from  the  text,  even  if  the  indexer  does  not 
know  the  meaning  of  a  particular  word  and  does  not  look  it  up.    For 
example,  the  term  GUANA ZALO  (an  antineoplastic  purine  analogue) 
will  be  placed  in  its  synonym  set  under  the  preferred  heading  8- 
AZAGUANINE  by  the  editor  the  first  time  it  occurs  and  mechanically 
thereafter.    The  intellectual  effort  required  to  relate  the  two  terms 
is  thus  expended  only  once.? 

There  is  expectation,  too,  at  least  among  the  scientific  staff, 
that  consistency  of  indexing  will  be  higher  with  this  open  type  of  vo- 
cabulary than  with  a  controlled  vocabulary  because  it  is  assumed  that 
"authors  are  more  consistent  among  themselves  in  their  use  of 
technical  terms  than  indexers  are  in  relating  authors'  technical 
terms  to  an  approved  vocabulary. "7    Moreover,  indexing  with  this 
type  of  vocabulary  should  not  only,  on  the  whole,  produce  greater 
specificity  but  should  also  reflect  new  subjects  and  new  terms  in  the 
literature  as  soon  as  citations  are  added  to  the  data  base. 

MEDLARS  is  a  general  service  for  the  biomedical  community 
and  thus  a  first  line  support  for  the  highly  specialized  Brain  Infor- 
mation Service.    BIS  will  handle  more  kinds  of  information,  but  its 
total  file  will  be  far  smaller.    While  the  MEDLARS  Station  looks  to 
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NLM  for  future  direction  and  development,  BIS  must  look  to  itself  but 
take  full  advantage  of  whatever  it  can  usefully  incorporate  from 
MEDLARS  and  from  the  other  neurological  information  centers  with 
which  it  works  in  close  cooperation. 


Grateful  acknowledgment  is  made  to  Mr.  Robert  Braude,  Head 
Searcher  for  the  UCLA  MEDLARS  Station,  Mrs.  Pat  Walter,  Head  of 
the  Bibliographic  Unit  of  BIS,  and  Dr.  Peter  Amacher,  Editor  of  the 
Glossary,  for  advice  and  information  for  this  paper. 
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Figure  2 

Organization  of  UCLA  Search  Station  and  its  Relation  to  other  Sta- 
tions and  the  National  Library  of  Medicine. 
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University  of  California,  Los  Angeles 
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MEDLARS  Search  Station 
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Figure  3 
Search  Request  Form,  UCLA  MEDLARS  Station. 
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Demand  Search  Formulation  Record 


110 


Request  number: 
1  6 
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Figure  5 

Verso  of  Demand  Search  Formulation  Record  showing  the  Request  Element 

Definition  List. 
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Figure  6 

Flow  chart  for  Preliminary  Edit  Program  used  in  UCLA  MEDLARS 

Search  Station. 
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EDIT 


Figure  7 
Flow  Chart  showing  Search  and  Report  routines,  UCLA  MEDLARS  Station. 
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Los  Angeles:  Biomedical  Library 

The  bibliography  which  you  have  received  is  a  product  of  MEDLARS, 
a  computer-based  information  storage  and  retrieval  system.  While 
the  system  is  in  its  developmental  phase  all  information  concern- 
ing the  operation  is  of  value  to  its  future  success. 

As  a  user  of  MEDLARS  you  are  in  a  position  to  provide  both  general 
and  specific  criticism  of  the  systems  output.  Since  this  is  the 
area  of  most  vital  interest  to  you,  we  should  greatly  appreciate 
your  comments  on  the  attached  appraisal  form. 

Although  this  form  lists  certain  major  areas  which  can  be  evaluated 
by  checking  the  appropriate  box,  we  have  left  space  for  more  general 
comments,  and  we  hope  that  you  will  consider  giving  us  the  fullest 
evaluation  possible. 

Again  we  should  like  to  stress  that  you,  the  user,  can  provide  the 
most  important  feedback  to  the  system;  this  is  what  we  particularly 
need  now  during  the  developmental  stage. 


Thank  you  for  your  cooperation. 


MEDLARS  Search  Station 
Biomedical  Library 
University  of  California, 
Los  Angeles 


Figure  11 
Request  for  Evaluation  of  a  MEDLARS  Search. 
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SEARCH  NUMBER 
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UNIVERSITY  OF  CALIFORNIA,  LOS  ANGELES 

BIOMEDICAL  LIBRARY 
MEDLARS  Search  Station 

Appraisal  of  MEDLARS  Search 
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citations,   (e.g.,  descriptors  used,  formulation  of  search  for  retrieval) 


Timeliness 

Did  you  receive  the  citations  in  time  to  be  of  use?  .  Do  you 

have  any  comments  on  timeliness? 


General  Comments  (Use  reverse  side  if  necessary) 


Figure  12 
Form  for  Appraisal  of  a  MEDLARS  Search. 
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Input  for  Brain  Information  Service  Computer  Store 
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Size  and  Growth  of  Brain  Information  Service  Computer  Store 
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KONISMUK  BR 
SAHYER  CH 


INDEXED  UNDER  TERM  FROM  TERM  CARD  —  152450, 


CLEMENTE    C 


THE    MESSAGES    BELON   REFER    TO    OECKLET    —   066040600602 
(ELECTRICAL    RESPONSES    TO   VISUAL    STIMULATION    IN    MIOBRAINi 


LLA    OBLONGATA    AND   SPINAL    CORD   OF    THE    LAMPREY) 


THIS  TERM  FROM 

TITL  CARD  IS 

NOT 

IN 

THE  8ISH 

—  ELECTRICAL  RESPONSES 

THIS  TERM  FROM 
THIS  TERM  FROM 

TITL  CARD  IS 
TERM  CARD  IS 

NOT 
NOT 

IN 
IN 

THE  BISH 
THE  BISH 

~  LAMPREY 
—  LATENCY  PEKIOO 

THIS  TERM  FROM 
THIS  TERM  FROM 

TERM  CARD  IS 
TITL  CARD  IS 

NOT 
NOT 

IN 
IN 

THE  BISH 
THt  BISH 

—  LIGHT  STIMULATION 
—  MICBRAIN 

THIS  TERM  FROM 
THIS  TERM  FROM 

TERM  CARD  IS 
TITL  CARD  IS 

NOT 
NOT 

IN 

IN 

THE  dISH 

THE  BISH 

—  MOTOR  EFFERENT  PATHWAYS 
—  VISUAL 

THIS  TERM  FROM 
INDEXED  UNDER 

T 

TITL  CARD  IS 
:RM  FROM  TITL 

NOT 
CARD 

IN 

THE  BISH 
069363, 

—  VISUAL  STIMULATION 
MEDULLA  OBLONGATA 

INDEXED  UNDER 
INDEXED  UNDER 

TERM  FROM  TERM 
TERM  FROM  TITL 

CARD 
CAPO 

~~ 

080934, 
106785, 

OPTIC  NkRVE 
SPINAL  CORD 

INDEXED  UNDER 
INDEXED  i)NOER 

TERM  FROM  TITL 
TERM  FROM  TITL 

CARD 
CARD 

__ 

150819, 
151925, 

ELECTRICAL 
STIMULATION 

INDEXED  UNDER 
INDEXED  UNDER 

TERM  FROM  TERM 
TERM  FROM  TERM 

CARD 
CARD 

__ 

151990, 

152411, 

TEG  ME  NT  UM 
HAYMARD  J 

THE    HESSAGFS    BELOW    REFER    TO    DECKLET    —    066040600702 


(EFFECT    OF    PANCRF.ATECTOMY    AND    ACETYLCHOL INE    ADMINISTRATION    ON 
ELECTRICAL    ACTIVITY    OF    MUSCLE    AND    FUNCTION    OF    THE    NERVE-MUSC 


LE    JUNCTION) 


THIS 

TERM 

FROM 

TITL 

CAHO 

IS 

NOT 

IN 

THE 

BISH    - 

- 

ACETYLCHOLINE    ADHI  NI  STRATION 

THIS 

THIS 

TERM 
TERM 

FROM 
FROM 

TERM 
TITL 

CARD 
CARD 

IS 
IS 

NOT 
NOT 

IN 
IN 

THE 

THE 

BISH   - 
BISH   - 

~ 

CONTRALATERAL    INFLUENCE 
JUNCTION 

THIS 
THIS 

TERM 

TERM 

FROM 
FROM 

TERM 

TERM 

CARD 
CARD 

IS 
IS 

NOT 
NOT 

IN 
IN 

THE 
THE 

BISH   - 
BISH   - 

_ 

LATENCY    PERIOD 

MOTOR   NERVE 

THIS 

THIS 

TERM 
TERM 

FROM 
FROM 

TITL 
TITL 

CARD 
CARD 

IS 
IS 

NOT 
NOT 

I  N 
IN 

THE 
THE 

BISH   - 

BISH    - 

~ 

NERVE-MUSCLE 
NERVE-MUSCLE    JUNCTION 

THIS 
THIS 

TERM 
MESH 

FROM 
TERM 

TERM 

ROM    S 

CARD 
TITL 

IS   NOT 
CARD    IS 

IN 
IN 

THE 
THE 

BISH   - 

BISH 

-    NEUROMUSCULAR    APPARATUS 
FILE    AS    NON-INDEXABLE    —    PANCREATECTOMY 

THIS 
THIS 

TERM 
TERM 

FROM 
FROM 

TERM 
TERM 

CARD 
CARD 

IS 

IS 

NOT 

NOT 

IN 

IN 

THE 

THE 

BISH    - 
BISH   - 

_ 

SYSTEMATIC    ACETYLCHOLINE    ADMINISTRATION 

TROPHIC    FACTOR 

INDEXED  UNDER  TERM  FROM  TITL  CARD 
INDEXED  UNDER  TERM  FROM  TITL  CARD 


001134, 
036309, 


ACETYLCHOLINE 
ELECTRICAL  ACTIVITY 


INDEXED  UNDER  TERM  FROM  TERM  CARD 
INDEXED  UNDER  TERM  FROM  TITL  CARD 


044709, 
074256, 


FROG 
MUSCLE 


INDEXED  UNDER  TF»M  FPOM  TITL  CARD 
INDEXED  UNDER  TERM  FPOM  TERM  CARD 


075054, 
150043. 


MYONEURAL    JUNCTION 
ACTION    POTENTIAL 


150928, 
151818, 


FUNCTION 

RESTING    POTENTIAL 


THE    MESSAGFS    BFLOW    RFFEP    TO    OECKLET    —    066040600802 


Figure  20 

Print-out  of  Warnings  and  Other  Messages  from  the  System  to  the  BIS  Glos- 
sary Editor. 
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A2B 
ELECTRICAL  RECORDINGS  FROM  BRAIN  (See  also  BRAIN  ELECTROPHYSIOLOGY  6> 

EEC  WAVES,  ACTIVITIES,  RHYTHMS  AND  COMPLEXES 
SLOW- WAVE  EEC 

DELTA  ACTIVITY 

DELTA  RHYTHM 

DELTA  WAVE 

THETA  ACTIVITY 

THETA  RHYTHM  (Theta-rhythm 
THETA  WAVE 

ALPHA  ACTIVITY  (See  also  ALPHA  ATTENUATION) 
ALPHA  RHYTHM  (Alpha-rhythm 

ALPHA  WAVE 
BETA  ACTIVITY 

BETA  RHYTHM 

BETA  WAVE 

GAMMA  ACTIVITY 

GAMMA  RHYTHM 

GAMMA  WAVE 

Figure  21 
Table  Showing  a  Section  from  the  BIS  Glossary  Hierarchy. 
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OPERATIONAL  COMPUTER-BASED 
SYSTEMS  AT  THE  CHEMICAL  ABSTRACTS  SERVICE 


Seldon  W.  Terrant,  Jr.,  and  J.  L.  Wood 


The  American  Chemical  Society  (ACS)  is  a  non-profit  organi- 
zation chartered  by  Congress  as  a  scientific  and  educational  organi- 
zation.   The  Chemical  Abstracts  Service  (CAS)  is  a  division  of  the 
American  Chemical  Society. 

Like  any  society  the  ACS  is  concerned  with  helping  its  members 
to  get  their  papers  published  and  to  help  chemists  and  chemical  engi- 
neers to  gain  recognition  in  the  scientific  and  general  public  com- 
munities.   Equally  important,  however,  the  ACS  wants  to  have  avail- 
able information  put  to  use.    To  this  end,  it  is  concerned  that  all 
forms  of  research,  development,  and  applied  technology  be  fully  re- 
corded, indexed,  and  retrievable  for  the  full  scientific  community. 
Chemical  information  is  not  used  only  by  chemists  and  chemical  engi- 
neers.   It  is  used  by  many  other  kinds  of  scientists.    In  line  then  with 
the  major  purpose  of  getting  available  information  put  to  use,  it  is 
clear  that  the  ACS  operates  the  Chemical  Abstracts  Service  not  only 
for  the  benefit  of  members  of  the  Society,  but  for  a  broader  reason. 
We  want  chemical  and  chemical  engineering  information  put  to  use  in 
the  full  scientific  community  so  that  the  general  public  will  reap  the 
benefits  which  such  use  will  provide. 

When  the  full  dimensions  of  the  information  problem  for  chemis- 
try and  chemical  engineering  are  considered,  it  is  easy  to  see  why  a 
community  service  is  a  requirement  for  all  those  who  use  scientific 
information.    If  information  processing  is  carried  out  well,  the  infor- 
mation will  be  cheaper  and  more  easily  available.    If  it  is  not  done 
well,  such  operations  will  not  continue  to  exist.    Considering  not  only 
chemistry  and  chemical  engineering  information,  but  also  all  of  the 
other  scientific  and  technical  information  available,  it  is  not  possible 
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for  any  one  institution  to  meet  all  of  its  information  needs  from  with- 
in its  own  resources.    Community  services  such  as  Chemical  Ab- 
stracts are  a  necessity  if  information  is  to  be  used  to  the  best  ad- 
vantage. 

When  the  full  user  community  for  scientific  and  technical 
information  and  the  diversified  needs  of  the  community  from  insti- 
tution to  institution  and  from  individual  to  individual  are  considered, 
it  can  be  seen  that  the  traditional  means  of  publication,  which  as- 
semble information  into  traditional  printed  form,  are  neither  flexible 
enough  nor  responsive  enough  to  meet  today's  needs  adequately.    For 
this  reason,  during  the  last  five  years  the  ACS/ CAS  started  to  shift 
its  operations  to  a  computer  base.    It  is  expected  that  all  CAS  publi- 
cations and  services  will  be  composed  through  the  computer  by  mid- 
1969. 

In  addition  to  providing  acquisition,  analysis,  organization,  and 
indexes  for  all  chemical  and  chemical  engineering  information,  CAS 
intends  to  provide  systems  through  which  the  stored  data  can  be  ade- 
quately tapped  for  routine  use  by  the  scientific  community.    The  shift 
to  a  computer  base  will  provide  the  additional  flexibility  which  is 
necessary  to  be  sure  that  the  information  is  fully  used. 

Several  of  the  CAS  computer  applications  involve  or  affect  the 
operations  of  the  CAS  Library.    This  paper  summarizes  three  of 
these  applications.    First,  CAS  maintains  a  computer-based  inventory 
of  published  abstracts  and  rejected  abstract  manuscripts.    This  is  an 
on-line  operation,  designed  to  produce  management  guidance  tools- 
tools  useful  not  only  to  CAS  executive  management,  but  also  tools  that 
are  very  useful  to  the  CAS  Library,  especially  in  its  acquisitions 
work.    The  second  is  a  computer -produced  directory  of  serial  and 
non- serial  titles  and  their  ASTM  Coden.    The  third  is  a  computer- 
based  project  still  partially  in  the  developmental  stage.    This  is  a 
computer-based  list  of  periodicals  for  chemistry  and  chemical  engi- 
neering.   This  is  an  actual  scheduled  project  with  definite  internal 
applications  and  with  an  output  that  will  be  of  direct  benefit  to  the 
scientific  and  technical  information-handling  community. 

Before  describing  our  computer-based  inventory  system,  the 
framework  for  this  topic  should  be  set  by  referring  to  a  few  statistics 
which  in  themselves  justify  the  system.    In  1966,  CAS  expects  to  pub- 
lish approximately  225,000  abstracts  of  published  papers  and  patents 
in  Chemical  Abstracts.    Of  these,  170,000  will  be  abstracts  of  papers 
and  government  reports  which  originate  from  over  100  countries. 
Approximately  50  percent  of  this  collection  is  published  in  English, 
while  the  remaining  50  percent  appears  in  about  50  other  languages. 
This  material  will  come  from  nearly  11,000  different  serials  and 
from  between  600-700  non-serial  publications,  such  as  the  proceed- 
ings volumes  of  conferences.    It  will  also  announce  the  availability  of 
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approximately  3,000  new  books.    To  look  at  this  another  way,  assume 
that  a  reasonable  estimate  of  the  world's  annual  output  of  scientific 
and  technical  articles  is  one  or  two  million  articles,  and  that  a  rea- 
sonable estimate  of  the  total  number  of  scientific  and  technical  jour- 
nals is  between  35,000  and  50,000.    CAS  then  has  the  problem  of 
abstracting  somewhere  between  11.3  percent  and  22.5  percent  of  the 
world's  annual  output  of  scientific  and  technical  papers  and  monitor- 
ing somewhere  between  22  percent  and  32  percent  of  the  world's 
scientific  and  technical  journals. 

It  is  apparent  then  from  these  statistics,  as  well  as  from  com- 
parable statistics  for  earlier  years  of  CAS  activity,  that  an  inventory 
of  work  accomplished  is  an  important  management  tool.    It  has  been, 
and  still  is,  necessary  for  us  to  have  statistical  data  about  the  number 
of  abstracts  we  publish,  the  length  of  the  abstracts,  and  the  number  of 
abstracts  related  to  various  sub-disciplines  of  chemistry,  in  order 
to  plan  for  future  growth  and  to  establish  staff  and  financial  require- 
ments.   CAS  needs  to  know  the  source  of  its  abstracts  for  abstractor 
remuneration  purposes.    It  needs  to  know  which  primary  journals 
contained  the  papers  that  were  abstracted,  and  the  relative  degree 
of  productivity  of  abstractable  papers  of  these  journals.    CAS  needs 
to  know  how  many  of  the  abstracts  are  prepared  in-house  versus  those 
that  are  prepared  by  voluntary  abstractors,  and  the  distribution  of  the 
abstracts  throughout  Chemical  Abstracts. 

Prior  to  1963,  some  of  these  statistics  were  available  as  the 
result  of  various  hand  counts,  while  others  were  lacking  completely. 
In  1963  a  team  consisting  of  persons  well  versed  in  existing  CAS  pro- 
cedures and  of  systems  design  personnel  devised  a  computer-based 
inventory  system  which  was  capable  of  producing  the  needed  reports 
from  a  single  input  operation.    Although  this  is  an  on-line  operation, 
it  is  not  static.    It  is  undergoing  continual  change.    The  intention  is  to 
convert  from  manual  to  machine -based  input  as  a  part  of  overall 
mechanization  of  CAS  operation.    This  discussion,  however,  will  be 
devoted  to  the  existing  system. 

One  of  the  factors  that  enabled  CAS  to  design  the  present  sys- 
tem was  the  ability  to  represent  the  titles  of  primary  publications  in 
a  highly  compacted  form  by  using  the  American  Society  for  Testing 
and  Materials  Coden.    CAS  first  adopted  the  use  of  Coden  for  its 
computer-based  publication,  Chemical  Titles.    Of  the  40,000  five- 
character  unique  codes  that  ASTM  has  assigned  to  periodical  and  non- 
periodical  titles,  24,600  of  these  were  assigned  at  the  request  of  the 
Chemical  Abstracts  Service.    Coden  have  now  been  assigned  to  all 
titles  listed  in  the  Chemical  Abstracts  List  of  Periodicals  back  to  and 
including  the  1956  edition. 
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The  computer-based  inventory  begins  with  five  documents: 

(1)  the  printed  issues  of  Chemical  Abstracts  in  page  proof  format, 

(2)  manuscripts  submitted  for  publication  in  Chemical  Abstracts  that 
have  been  rejected,  (3)  a  directory  of  periodical  titles  and  their  unique 
Coden,  (4)  the  coding  sheets  for  printed  abstracts,  and  (5)  the  coding 
sheets  for  rejected  abstracts.    The  worksheet  for  printed  abstracts 
was  designed  in  conjunction  with  a  set  of  conventions  which  enables 
clerical  employees  to  record  pertinent  statistical  data  about  printed 
abstracts  at  the  rate  of  700-800  abstracts  per  day  (see  Figure  1). 

The  information  across  the  top  of  the  form  identifies  the  coder,  the 
volume  of  Chemical  Abstracts  (CA),  the  issue,  and  the  section  which 
she  is  coding.    The  form  is  divided  into  four  basic  parts.    Part  1  is 
for  CA  reference  data;  Part  2,  the  original  reference  data;  Part  3, 
the  name  of  the  abstractor  or  the  in-house  abstract  code;  and  Part  4 
is  the  amount  of  space  occupied  in  CA  by  the  abstract  and  related 
matter,  such  as  structures,  cross  references,  and  blank  space.    Under 
the  CA  reference,  space  is  provided  to  record  the  CA  column  number, 
the  location  of  the  abstract  within  the  column  by  a  letter  "a"  through 
"h,"  and  a  more  specific  location  within  the  letter  fraction  by  a 
numeral.    The  language  of  publication  of  the  paper  is  also  indicated. 
If  the  abstract  exceeds  a  certain  length,  this  is  indicated  under  prior 
lines. 

For  the  original  reference  we  use  the  Coden  to  designate  the 
journal,  and  add  the  year,  volume,  issue,  and  beginning  page  data. 
Patent  numbers  are  recorded  in  the  column  for  page  data  while  lengthy 
identification  such  as  report  numbers  are  listed  in  the  space  for  other 
identification.    The  country  in  which  the  work  was  done  and  the  ter- 
minal page  number  are  also  indicated. 

Under  the  next  heading  is  written  the  last  name  and  initials  of 
the  abstractor  or  a  four -character  code  that  specifies  an  abstract 
produced  in-house.    This  four -character  code  indicates  the  source  of 
the  abstract,  for  example,  an  author  abstract,  a  translation  of  an  au- 
thor abstract,  a  copy  abstract,  or  an  abstract  prepared  by  a  CAS  staff 
member.    The  last  column  is  used  to  note  by  number  of  lines  the 
length  of  the  abstract. 

When  coding  for  an  issue  of  CA  has  been  completed,  data  on  the 
completed  worksheets  are  transcribed  directly  onto  magnetic  tape 
using  a  Mohawk  Science  Data  Recorder,  model  1101.    These  data  are 
then  immediately  processable  in  the  computer  to  produce  the  follow- 
ing reports. 

Report  1  is  a  density  report  (see  Figure  2).    As  an  editorial 
check,  it  is  useful  to  follow  closely  the  variation  of  page  density  of 
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abstracts  for  each  of  the  74  sections  of  CA,  that  is,  the  number  of  ab- 
stracts in  each  section  and  the  number  of  pages  required  for  each 
section.    This  variation  gauges  the  amount  of  editorial  effort  devoted 
to  each  section.    These  reports  are  produced  biweekly.    They  show: 

(1)  the  number  of  abstracts  of  papers  and  patents  in  each  section; 

(2)  the  amount  of  space,  in  columns,  devoted  to  abstracts  of  papers 
and  patents  in  each  section;  and  (3)  the  total  number  of  abstracts  and 
columns  for  each  section.    Each  issue  report  also  shows  these  same 
data  cumulated  for  all  issues  of  the  volume  being  inventoried. 

Report  2  is  the  in-house  abstracting  report.    Each  CAS  abstract 
produced  in-house  is  coded  in  CA  with  a  descriptive  four -character 
code.    Statistics  for  abstracts  prepared  by  full  time  CAS  staff  become 
important  as  new  services  such  as  Chemical-Biological  Activities  are 
offered,  and  as  the  use  of  author  abstracts  changes.    This  report  is 
prepared  for  each  volume  of  CA.    It  provides  a  complete  inventory  of 
in-house  abstracts  showing  the  total  number  in  each  issue  (see 
Figure  3),  and  their  distribution  by  section  (see  Figure  4). 

Report  3,  a  three -part  report,  is  designed  to  aid  library  ac- 
quisition work.    It  is  important  for  us  to  know  the  sources  of  the 
papers  that  are  abstracted  and  to  relate  these  data  to  the  acquisition 
and  processing  of  the  journals  throughout  CAS. 

Report  3A  (see  Figure  5)  lists  alphabetically  by  Coden  all  of  the 
primary  and  secondary  journals,  patents,  books,  and  one -shot  publi- 
cations (conference  proceedings,  monographic  collections  of  papers, 
etc.)  represented  by  the  abstracts  in  a  volume  of  CA.    Opposite  each 
Coden  is  the  number  of  abstracts  from  the  publication.    This  report 
enables  CAS  to  determine  a  cost/use  ratio  for  the  documents  it  pur- 
chases. 

Report  3B  (see  Figure  6)  is  arranged  in  descending  sequential 
order  by  the  number  of  abstracts  each  document  produced.    That  is, 
it  begins  with  the  journal  which  contained  the  most  papers  abstracted 
and  ends  with  the  one  which  contained  the  least.    This  report  empha- 
sizes the  more  productive  journals  and  serves  as  a  guide  for  estab- 
lishing candidate  journals  for  express  handling.    For  each  primary 
journal  and  each  one-shot  publication  this  report  shows: 

(1)  cumulative  number  of  primary  journals  and  one-shots, 

(2)  abstracts  per  title, 

(3)  cumulative  number  of  abstracts  of  primary  journals  and 
one -shots, 

(4)  percent  of  total  number  of  abstracts  at  5-6  percent  intervals, 

(5)  cumulative  number  of  primary  journals, 

(6)  cumulative  number  of  abstracts  of  primary  journals, 

(7)  percent  of  total  number  of  abstracts  for  primary  journal 
papers  based  on  5-6  percent  intervals, 

(8)  cumulative  number  of  one-shots, 
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(9)  cumulative  number  of  abstracts  of  one -shots,  and 
(10)  percent  of  total  number  of  abstracts  for  one-shots  based  on 
5-6  percent  intervals. 

The  tapes  used  to  generate  these  volume  reports  are  merged,  and 
listings  of  journals  with  cumulative  productivity  data  are  printed. 
These  cumulative  lists  give  us  a  truer  picture  of  the  journals  which 
are  most  productive,  and  these  are  the  journals  we  handle  on  an  ex- 
press basis.    By  cumulating  data  from  these  reports  for  four  years, 
and  plotting  the  number  of  journals  required  to  produce  any  given 
percentage  of  the  total  number  of  abstracts,  an  interesting  curve  is 
obtained  (see  Figure  7).    The  curve  shows  that  25  percent  of  the  total 
number  of  abstracts  were  obtained  from  50  journals;  50  percent  were 
obtained  from  250  journals;  75  percent  were  obtained  from  1,000 
journals;  while  90  percent  came  from  slightly  over  3,000  journals. 
The  remaining  10  percent  of  the  abstracts  came  from  an  additional 
2,500  journals.    We  monitor  an  additional  8,000  journals  of  which  an 
average  of  2,500  produce  the  final  10  percent  of  the  total  number  of 
abstracts. 

Report  3C  (see  Figure  8).    The  growth  in  many  areas  of  chemis- 
try and  chemical  engineering  is  gauged  by  monitoring  other  secondary 
journals  that  overlap  CA  coverage.    We  currently  monitor  approxi- 
mately 50  such  secondary  publications.    This  report  lists  the  primary 
journals  that  contain  papers  covered  as  copy  abstracts. 

Report  4  is  used  to  generate  the  CA  abstractors'  payroll.    The 
output  is  the  abstractors'  actual  pay  checks.    CAS  pays  its  abstractors 
by  the  line  as  printed  in  CA.    Therefore,  it  is  necessary  to  determine 
for  each  abstract  the  number  of  lines  and  to  record  this  information 
along  with  the  abstractor's  name.    The  payroll  is  produced  on  a 
quarterly  basis. 

Report  5.    Certain  areas  of  chemistry  produce  a  great  many 
index  entries  per  abstract.    For  instance,  papers  dealing  with  chemi- 
cal synthesis  report  a  great  many  compounds.    The  corresponding 
sections  of  CA  are  indexed  directly  from  the  original  journal  article. 
Thus  for  the  indexers  of  these  sections  their  primary  document  needs 
can  be  predetermined.    One  of  the  products  of  the  inventory  is  the 
direct  print- out  of  library  call  slips  (see  Figure  9)  for  those  sections 
which  have  a  high  density  of  index  entries  per  paper.    This  permits 
the  library  to  arrange  the  original  journal  material  in  a  manner  con- 
venient for  the  indexing  effort. 

Report  6  is  a  listing  of  document  titles,  with  the  number  of  ab- 
stracts from  each,  by  subject  based  on  CA's  74  sections.    Either 
single  sections  or  a  combination  of  sections  can  be  listed.    Coverage 
within  each  of  the  individual  sections  of  CA  is  continually  watched . 
In  addition,  each  year  we  select  several  sections  to  investigate  in 
depth.    Here  we  review  with  the  section  editor  those  subdivisions 
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which  he  feels  deserve  more  attention,  and  the  primary  journal 
sources  for  each  subdivision  are  investigated. 

Report  7  is  a  listing  of  references  to  printed  abstracts  arranged 
according  to  bibliographical  citations  by  primary  journals.    It  is  used 
to  check  coverage  of  a  given  journal.    When  merged  with  Report  8, 
which  is  described  below,  it  serves  as  a  useful  tool  to  search  for  holes 
in  the  coverage.    We  intend  to  extend  the  usefulness  of  this  report  by 
inputting  data  about  papers  that  have  been  perused  but  not  selected; 
thus,  any  gaps  in  the  coverage  will  be  very  apparent;  and,  in  fact,  the 
report  will  take  a  negative  approach  to  the  problem,  i.e.,  it  shows  us 
what  has  not  been  entered  into  the  CAS  system. 

Report  8.    Another  aspect  of  the  computer-based  inventory  is 
the  record  of  abstract  manuscripts  that  have  been  rejected.    Each 
year  a  number  of  the  abstracts  submitted  to  us  for  publication  do  not 
meet  the  criteria  for  publication  in  CA.    For  example,  they  do  not  re- 
port chemical  information  or  new  chemical  information.    Others  are 
duplicates  of  abstracts  that  have  already  been  published.    Duplicates 
fall  into  two  classes:    those  papers  which  are  abstracted  twice  by 
error  and  those  papers  that  are  published  in  two  or  more  sources. 
These  rejected  abstracts  are  coded  on  worksheets  (see  Figure  10), 
transcribed  onto  magnetic  tape,  and  reports  are  generated  which  show 
(1)  the  primary  journal  address  of  the  article,  (2)  the  name  of  the  ab- 
stractor, (3)  why  the  paper  was  abstracted,  that  is,  did  the  abstractor 
do  it  voluntarily  or  did  we  ask  him  to  do  it,  and  (4)  why  the  abstract 
was  not  used. 


CAS  Coden  Directory 


A  basic  tool  for  all  of  our  computer-based  work  which  involves 
the  titles  of  journals  and  abstracted  monographs  is  the  CAS  Coden 
Directory.    This  Directory  now  contains  an  alphabetical  listing  of 
serials  and  nonserials  by  full  title,  each  accompanied  by  a  unique 
five -character  Coden  with  a  numerical  sequencing  or  alphabetizing 
number  (see  Figure  11).    The  Directory  has  an  index  arranged  alpha- 
betically by  Coden.    The  data  contained  in  the  Directory  is  all  in 
upper-case  type.    We  are  now  in  the  process  of  writing  programs 
which  will  enable  us  to  revise  the  Directory.    The  new  Coden  Direc- 
tory will  contain  (1)  the  Coden,  and  in  upper  and  lower  case  type 
(2)  the  full  title,  and  (3)  the  American  Standards  Association  title 
abbreviation  (see  Figure  12).    This  Directory  will  also  contain  a  nu- 
merical sequencing  number  and  will  be  accompanied  by  an  index  ar- 
ranged alphabetically  by  Coden.    This  Coden  Directory  constitutes  an 
important  source  of  input  for  the  periodicals  list  as  will  be  explained 
later  on. 
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Periodicals  List 


Before  describing  the  work  that  we  are  doing  on  our  new  List  of 
Periodicals,  the  history  of  this  continuing  project  is  worthy  of  men- 
tion.   CAS  has  always  paid  conscientious  attention  to  nomenclature. 
This  applies  not  only  to  the  names  of  chemical  compounds,  but  also 
to  the  titles  of  chemical  publications.    In  1906  the  American  Chemical 
Society  published  a  list  of  journal  titles  in  the  Proceedings  of  the 
American  Chemical  Society.    It  was  entitled  "A  List  of  Abbreviations" 
and  contained  37  journal  titles  and  the  abbreviated  form  for  each. 
This  list  was  revised,  enlarged,  and  issued  as  a  separate  publication 
under  the  promising  title  List  of  Journals  to  be  Abstracted  by  Chemi- 
cal Abstracts.    It  preceded  by  several  months  the  first  issue  of 
Chemical  Abstracts  and  consisted  of  abbreviated  titles  for  370  jour- 
nals.   In  1908  the  list  was  again  revised  and  published  as  a  part  of  the 
introduction  to  the  second  volume  of  Chemical  Abstracts.    It  contained 
475  abbreviated  titles  and  was  accompanied  by  three  simple  rules  per- 
taining to  journal  title  abbreviation.    In  1910  the  list  appeared  as  the 
List  of  Periodicals,  and  the  form  of  entry  still  in  use  today  was  intro- 
duced.   The  full  title  was  given  with  those  letters  constituting  the 
abbreviation  set  in  bold  face  type.    In  addition  to  the  title  and  abbrevi- 
ation, the  entries  included  frequency,  volume  number,  price,  and  the 
publisher's  address.    The  List  continued  to  be  issued  biennially  until 
1922,  when  a  second  important  feature  was  added— library  source- 
guide  data  for  162  American  libraries.    This  key  to  library'files  gave 
new  value  to  the  List,  for  it  now  became  a  useful  tool  for  locating  ab- 
stracted journals  in  the  libraries  of  the  United  States. 

The  purpose  of  the  List  was  two-fold.    First,  it  was  to  stan- 
dardize the  abbreviated  title  form.  Second,  it  was  to  furnish  biblio- 
graphic data  about  abstracted  papers  by  providing  a  library  source 
guide.    Beginning  in  1936  the  List  was  published  quinquennially.    In 
1957  the  first  annual  supplement  to  the  quinquennial  edition  appeared. 

CAS  has  now  begun  work  on  a  completely  new  edition  of  the  List 
of  Periodicals.    This  new  List  is  being  designed  to  include  much  more 
data  than  did  earlier  editions.    It  is  to  be  made  available  in  both 
printed  and  machine -readable  form  (either  punched  cards  or  magnetic 
tape).    Programs  for  searching  the  tapes  will  also  be  made  available. 

This  catalog  of  periodicals  related  to  chemistry  and  chemical 
engineering  will  bring  into  a  single  source  much  valuable  bibliographic 
data  in  considerable  detail  that  is  now  only  partially  available  and 
from  a  large  number  of  bibliographies,  union  lists,  and  individual 
library  files.    A  typical  entry  in  the  List  will  include  the  full  title  of 
the  publication;  a  translation  of  the  title  if  the  language  is  other  than 
English,  French,  German,  or  Spanish;  the  ASA  Z39  standard  title 
abbreviation;  the  ASTM  Coden;  the  languages  of  publication  and 
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summaries;  the  history  of  the  periodical  with  references  to  any  for- 
mer titles,  current  frequency,  and  volume  number  data;  the  pub- 
lisher's address;  and  the  price.    The  entries  will  also  include  the 
title  cataloged  according  to  American  Library  Association  cataloging 
rules  and  the  library  holdings  information  (note  that  we  will  list  hold- 
ings rather  than  merely  an  indication  that  a  library  has  a  current 
subscription  to  a  journal). 

Initially  we  intended  to  publish  a  1966  edition  of  the  Chemical 
Abstracts  List  of  Periodicals  with  Key  to  Library  Files.    This  List 
would  have  been  merely  an  updated  version  of  the  1961  List.    The 
relationship  between  it  and  the  1961  List  would  have  been  identical  to 
the  relationship  between  the  1961  List  and  the  1956  List,  that  is,  al- 
ready published  information  about  discontinued  periodicals  and  name 
changes  would  not  have  carried  forward  into  the  new  List.    Had  this 
course  been  followed,  the  1966  List  would  have  been  the  14th  CA  List 
of  Periodicals.    To  do  a  literature  search  using  all  of  Chemical  Ab- 
stracts, one  would  have  had  to  use  all  14  editions.    As  an  alternative 
we  investigated  the  possibility  of  publishing  a  60-year  cumulative 
edition  of  the  List  of  Periodicals,  thus  bringing  within  one  set  of 
covers  pertinent  bibliographic  data  about,  and  library  source  guide 
information  for,  the  serials  listed  in  all  previous  editions. 

The  next  logical  extension  of  the  List  was  to  consider  the  in- 
clusion of  data  for  other  serial  publications  related  to  chemistry  and 
chemical  engineering  which  had  not  been  abstracted  by  Chemical  Ab- 
stracts.   A  survey  of  Beilstein's  Handbuch  der  Qrganischen  Chemie 
revealed  some  300  pre-1907  titles.    Coverage  of  the  literature  of 
pure  and  theoretical  chemistry  from  1830  through  1940  by  Chemisches 
Zentralblatt  (CZ)  was  particularly  outstanding.    We  discovered  ap- 
proximately 500  journals  covered  by  CZ  prior  to  1940  had  not  been 
covered  by  CA.    We  also  developed  a  list  of  approximately  one  hundred 
defunct  nineteenth- century  chemical  journals  to  be  included. 

Thus  the  List  which  is  proposed  will  include  approximately 
24,000  entries  for  journals,  and  5,000  entries  for  monographs  ab- 
stracted by  Chemical  Abstracts,  Chemisches  Zentralblatt,  and  covered 
by  Beilstein,  as  well  as  the  nineteenth -century  chemical  journals 
never  covered  by  any  of  these  services.    This  extension  makes  our 
List  truly  a  comprehensive  list  of  periodicals  for  chemistry  and 
chemical  engineering. 

The  procedure  used  to  generate  this  List  may  actually  be  viewed 
as  two  separate  projects:    (1)  gathering  the  pertinent  bibliographic 
data,  and  (2)  obtaining  library  holdings  information  from  some  350- 
400  cooperating  libraries.    Our  first  step  was  to  establish  a  file  of 
the  serial  and  nonserial  titles.    This  was  done  by  cutting  and  pasting 
the  entries  from  all  of  the  Lists  of  Periodicals,  entering  them  into  a 
single  file,  and  adding  to  this  file  titles  for  periodicals  from  Chem- 
isches Zentralblatt,  Beilstein,  and  several  other  sources.    Once  the 
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file  was  established,  the  Coden  was  added  to  each  main  entry.    This 
necessitated  the  acquisition  of  several  hundred  additional  Coden  from 
ASTM  (see  Figure  13). 

The  first  publication  our  work  generates  is  a  checking  edition 
for  the  List  of  Periodicals  (see  Figure  14).    The  checking  edition 
contains  the  titles  cataloged  according  to  ALA  rules,  the  Coden,  the 
history  of  the  serials,  and  a  space  for  the  cooperating  libraries  to 
write  in  the  dates  of  their  consecutive  holdings.    It  is  necessary  to 
furnish  the  cooperating  libraries  with  a  checking  edition  arranged  ac- 
cording to  ALA  cataloging  rules,  since  the  majority  catalog  their 
serials  according  to  these  rules.    The  checking  edition  for  the  CA 
1961  List  was  produced  on  unit  card  equipment.    It  contained  9,000 
entries  in  ALA  form.    Fortunately,  these  cards  had  been  retained. 

A  new  checking  edition  is  now  being  prepared  in  three  parts  for 
use  by  the  cooperating  libraries.    The  punched  card  with  the  ALA 
form  of  entry  is  matched  with  5"  x  8"  cards  from  the  master  file.    If 
no  punched  cards  are  available,  the  ALA  form  of  entry  is  written  on 
the  5"  x  8n  card.    Historical  data  are  added  and  the  cards  passed  to 
keypunchers  to  generate  a  new  set  of  punched  cards  containing  the 
Coden,  the  title  in  ALA  form,  and  the  history.    These  cards  are  inter- 
filed.   When  the  file  is  complete,  they  will  be  converted  to  tape.    The 
resulting  print-out  will  be  used  to  produce  camera- ready  copy  for 
printing  the  checking  editions. 

These  checking  editions  will  be  sent  to  the  cooperating  libraries, 
and  when  each  is  returned,  cards  will  be  punched  showing  the  Coden, 
the  cooperating  library's  code  number,  and  the  library's  consecutive 
holdings  of  the  serial  (see  Figure  15).    These  cards  will  be  read  onto 
tape,  sorted,  and  a  tape  containing  library  holdings  information  will 
be  ready  for  future  use. 

There  will  be  then  one  tape  with  the  full  title  of  a  publication  in 
upper  and  lower  case,  its  official  ASA  Z39  abbreviation,  and  ASTM 
Coden.    On  another  tape  there  will  be  the  ASTM  Coden,  the  ALA  form 
of  entry  for  the  publication,  and  the  history  data. 

The  cards  in  the  master  record  file,  which  contain  Coden  and 
entries  for  the  publications  as  they  appeared  in  a  CA  List  of  Periodi- 
cals, will  be  compared  to  current  issues  of  the  serials  and  nonserials. 
The  entries  will  be  edited,  expanded,  and  updated  (see  Figure  16). 
The  data  on  the  edited  master  record  file  card  will  then  be  keyboard- 
ed  on  a  Dura  Mach  -  II  paper  tape  generating  typewriter  for  later 
conversion  to  magnetic  tape.    The  worksheet  is  being  designed  so 
that  the  Dura  typist  will  not  have  to  type  any  of  the  information  al- 
ready in  machine -readable  form,  i.e.,  she  will  not  have  to  keyboard 
the  title  or  abbreviated  title,  the  history,  or  the  ALA  form  of  entry. 
These  will  all  be  retrievable  by  inputting  the  Coden.    She  must  add 
needed  subtitles,  English  translations,  references  to  the  former 
titles,  languages  of  publication  and  summaries,  the  frequencies  of 
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publication,  the  current  volume  numbers  and  years,  prices,  and  the 
identification  of  the  publishers.    Repetitive  phrases  such  as  "Printed 

in,"  "with summaries,"  etc.,  will  be  stored  in  the  computer  so 

that  these  do  not  have  to  be  keyboarded  but  only  coded. 

Entries  for  discontinued  periodicals,  for  former  titles  of  peri- 
odicals, and  for  congress  and  symposia  proceedings  volumes  will 
contain  those  elements  of  information  that  are  pertinent  to  them. 
For  instance,  entries  for  defunct  titles  will  not  contain  publisher  or 
price  data.    For  congress  and  symposia  proceedings  volumes,  the 
number  of  the  meeting,  if  it  is  one  of  a  series,  the  place  where  it  was 
held,  the  date  of  the  meeting,  the  number  of  volumes  of  papers  pub- 
lished, and  the  editor's  name  will  be  entered. 

After  the  paper  tape  has  been  generated  and  converted  to  mag- 
netic tape,  this  will  be  entered  into  the  computer,  matched  with  the 
data  from  the  Coden  directory  tape,  data  from  the  tape  containing  the 
ALA  form  of  entry,  and  the  history  information.    The  output  will  be  a 
proof  copy  which  will  then  be  checked  against  the  original  edited 
5"  x  8"  card.    If  any  corrections  need  to  be  made,  the  proof  copy  will 
be  edited  and  a  corrected  paper  tape  generated  and  recycled.    The 
result  of  this  work  will  be  the  bibliographic  data  tape.    This  tape  will 
be  merged  with  the  library  holdings  tape  and  a  master  record  tape 
generated  from  which  a  print-out  will  be  produced  for  camera-ready 
copy  for  the  printed  version  of  the  periodicals  list. 

From  this  master  record  tape,  it  will  be  possible  to  generate 
special  listings  of  the  journals  by  specified  characteristics  such  as 
country  of  origin,  language,  price,  etc.,  or  by  any  combination  of  the 
elements  which  constitute  the  complete  entry.    Library  holdings  and 
union  lists  will  also  be  obtainable  (see  Figure  17).    The  present  pro- 
duction schedule  calls  for  the  completion  of  the  Comprehensive  List 
of  Periodicals  by  the  end  of  October  1968. 


Chemical  Titles 


The  CAS  computer-based  system  really  began  in  1961  with  the 
publication  of  Chemical  Titles  (CT),  the  first  production- scale  appli- 
cation in  the  world  of  computers  to  produce  an  index  to  technical 
papers.    CT  was  the  forerunner  of  a  whole  new  generation  of  mecha- 
nized services.    CT  is  a  biweekly,  computer-based  publication  which 
covers  titles  of  chemical  and  chemical  engineering  interest  taken 
from  691  journals.    Approximately  90,000  titles  were  published  in  CT 
in  1965.    All  articles  covered  in  CT  are  also  covered  in  CA.    The 
majority  of  the  journals  covered  is  received  at  CAS  in  advance 
form  and  thus  the  coverage  is  quite  prompt.    The  titles  are  selected 
by  the  professional  staff  at  CAS,  translated,  and  then  edited.    Editing 
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of  the  titles  includes  segmentation  of  words  to  facilitate  scanning  of 
the  indexes  by  users.    The  article  titles,  Coden  (a  five-letter  code  for 
the  journal  title)  references,  and  authors  are  keypunched  into  cards, 
and  the  information  then  transferred  to  magnetic  tape  for  computer 
processing.    Check  lists  (corresponding  to  galley  proofs  in  standard 
publication  procedures)  are  printed  by  the  computer,  and  the  lists  are 
proofed  to  insure  the  accuracy  of  the  keypunched  data. 

Each  two  weeks  the  input  data  is  processed  by  the  computer  and 
sorted  into  the  three  parts  of  CT:    the  Keyword -in -Context  (KWIC) 
Index,  the  Bibliography  Section  (tables  of  contents),  and  the  Author 
Index  for  the  issue.    The  information  is  printed  by  the  computer  on 
graphic  arts  quality  paper  and  is  composed  into  pages  mounted  on 
special  cardboard  backing.    The  bold  face  journal  titles  are  prepared 
on  a  Varityper  Headliner  and  pasted  on  the  pages.    The  camera-ready 
copy  is  then  sent  to  the  printer,  where  it  is  photographically  reduced 
and  printed  on  an  offset  press. 

The  KWIC  index  is  an  alphabetical  arrangement  of  keywords 
selected  from  the  article  titles  (see  Figure  18).    The  keyword  ap- 
pears in  the  center  of  the  page  with  its  associated  words  preceding 
and  following  it  in  the  same  manner  as  they  do  in  the  title  itself. 
Since  the  number  of  words  surrounding  the  indexed  keyword  is  limited 
by  the  breadth  of  the  page,  words  or  phrases  are  sometimes  truncat- 
ed.   If  the  keyword  appears  near  the  beginning  or  ending  of  a  title,  the 
title  will  "wrap  around"  to  fill  the  line  space.    A  "wrap  around"  is 
indicated  by  a  small  cross  in  the  text.    Each  keyword  index  entry 
carries  a  reference  to  the  Coden,  volume  (or  issue),  and  page  of  the 
journal  in  which  the  title  can  be  found  (and  by  which  entries  are  listed 
in  the  bibliography  section).    The  KWIC  index  provides  an  average  of 
six  entries  to  each  title.    The  bibliography  section  lists  the  journals 
arranged  alphabetically  by  Coden  (see  Figure  19).    For  each  Coden, 
the  journal  title,  volume,  and  issue  are  given.    The  titles  taken  from 
each  journal  are  grouped  together.    The  author  index  lists  all  authors 
alphabetically  (see  Figure  20).    Each  index  entry  provides  a  reference 
to  the  bibliography  section. 

CT  is  thus  arranged  to  permit  the  location  of  information  of 
interest  via  keywords,  authors,  or  journals.    It  is  apparent  that  the 
magnetic  tapes  from  which  the  issues  of  CT  are  produced  can  be  use- 
ful in  special  subject  searches.    During  1964,  several  cooperative  ex- 
periments were  conducted  involving  CT  tapes.    In  one  of  the  experi- 
ments, individual  scientists  at  Eli  Lilly  and  Company  provided  search 
terms  covering  their  individual  interests.    Copies  of  the  magnetic 
tapes,  corresponding  to  each  CT  issue,  were  delivered  to  the  company 
and  computer  searches  were  performed.    Search  results  in  the  form 
of  individualized  listings  of  papers  were  given  to  the  participating 
scientists.    Each  individual  was  allowed  to  change  his  set  of  search 
terms  as  his  interest  became  better  defined  or  as  it  changed.    In 
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another  experiment,  searches  were  performed  at  CAS  for  the  Olin- 
Mathieson  Corporation.    In  this  program  the  set  of  search  terms 
represented  the  interests  of  a  research  group. 

The  highly  successful  results  of  these  experiments  and  the  sub- 
stantial interest  shown  by  other  organizations  provided  the  basis  for 
the  introduction  in  1965  of  subscriptions  to  CT  tapes  and  searches. 
A  subscriber  may  receive  copies  of  the  magnetic  tapes  or  he  may 
have  his  searches  run  by  CAS  in  Columbus,  Ohio,  with  the  search  re- 
sults supplied  in  the  form  of  a  computer  print-out.    The  tapes  and 
searches  are  provided  at  the  time  that  CT  copy  is  sent  to  the  printer 
—about  a  week  ahead  of  the  appearance  of  the  CT  issues.    Along  with 
the  tapes  are  provided  search  programs  written  for  an  IBM  1401  or 
1410.    These  programs  are  now  being  rewritten  for  the  IBM  System 
360  computers  and  are  expected  to  be  available  by  May,  1966.    The 
program  can  handle  "and,"  "or,"  and  "not"  logic.    It  is  also  possible 
to  specify  word  fragments  as  search  terms.    These  aspects  of  the 
program  will  be  explained  in  detail  later.    The  search  product  can  be 
any  or  all  of  the  following:    a  bibliography  of  the  selected  titles,  a 
KWIC  index  of  the  selected  titles,  or  an  author  index  based  on  the 
selected  titles. 

As  part  of  the  production  of  CT,  a  Coden-volume  checking 
feature  is  incorporated.    The  check  is  based  upon  sets  of  valid  Coden 
and  Coden-volume  relationships  stored  in  the  computer.    Each  time 
that  information  is  input  to  the  computer,  it  is  tied  to  the  appropriate 
bibliographic  data.    A  computer  check  is  made  to  determine  the  valid- 
ity of  these  data  and  a  list  of  invalid  relationships  is  printed  with  each 
check  list.    In  this  manner  errors  can  be  detected  prior  to  the  pro- 
duction of  camera-ready  copy. 


Chemical-Biological  Activities 


It  has  been  said  and  written  many  times  in  recent  years  that  the 
sheer  bulk  of  primary  literature  being  published  makes  it  almost  im- 
possible to  keep  abreast  of  new  information  in  a  given  field.    This  is 
particularly  true  in  the  biochemical  fields.    The  number  of  abstracts 
published  in  the  biochemical  sections  of  Chemical  Abstracts  in  1965 
is  estimated  at  63,000.    This  is  32  percent  of  the  approximately 
195,000  abstracts  published  for  ^11  fields.    The  figure  is  based  on 
coverage  of  approximately  11,000  journals,  in  50  languages,  from  100 
countries. 

When  the  magnitude  of  this  amount  of  information  is  considered, 
it  becomes  readily  apparent  that  specialized  alerting  and  retrieval 
services  are  needed.    To  help  meet  this  need,  CAS  introduced 
Chemical-Biological  Activities  (CBAC)  in  January  1965  to  provide  a 
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rapid  survey  of  literature  dealing  with  the  biological  activity  of  or- 
ganic chemical  compounds.    CBAC  is  a  biweekly,  computer-based 
publication  issued  in  two  volumes  annually. 

In  the  development  of  CBAC,  a  number  of  features  were  recog- 
nized as  being  desirable  for  a  cur  rent -awareness  publication  in  the 
field  of  chemical-biological  activity.    Among  these  are:    promptness; 
adequate  coverage  of  pertinent  journals;  information  on  the  journal 
sources,  including  names  and  addresses  of  the  authors;  a  format  that 
combines  legibility  with  ease  of  scanning;  brief  but  adequate  digests 
of  the  work  reported  (including  descriptions  of  the  substances  em- 
ployed, structural  formulas,  conditions  under  which  the  observations 
were  made,  and  an  account  of  the  results  obtained);  and  finally  a  set 
of  indexes  and  a  method  of  storing  data  such  that  the  indexes  can  be 
cumulated. 

The  developmental  work  on  CBAC,  including  the  original  con- 
cepts, the  systems  development,  and  the  necessary  computer  pro- 
gramming, was  done  by  the  CAS  Research  and  Development  Division, 
and  this  division  continues  to  assist  as  the  system  is  modified  and 
improved. 

CBAC  is  a  service  which  combines  the  values  of  an  alerting 
tool,  through  printed  abstracts  and  indexes  of  new  developments  in 
biochemistry,  with  computer-based  storage  and  retrieval  of  infor- 
mation.   As  such,  CBAC's  function  is  broader  than  that  of  either  a 
repository  for  information  or  an  abstract  publication  alone.    In  the 
latter  capacity,  as  has  been  mentioned,  it  is  a  biweekly  publication. 
Its  abstracts  are  known  as  "digests"  to  distinguish  them  from  those 
published  in  Chemical  Abstracts.    Since  a  considerable  portion  of  in- 
coming material  earmarked  for  CBAC  is  received  by  the  Library  in 
advance  form,  the  coverage  is  quite  prompt. 

Considered  on  a  broader  scale,  CBAC  is  not  only  a  self- 
sustaining  entity  but  is  an  integral  part  of  a  larger  system  for  spe- 
cific chemical  information  retrieval,  the  CAS  Chemical  Compound 
Registry.    The  Registry  is  based  on  the  assignment  of  individual  num- 
bers (Registry  Numbers)  to  each  organic  compound  of  known  structure 
which  enters  the  system.    The  Registry  Number  assigned  to  a  com- 
pound of  known  structure  identifies  it  completely— different  ionic 
forms,  isomers,  and  compounds  containing  unusual  isotopic  masses 
are  given  unique  numbers.    Use  of  the  numbers  provides  a  specific 
method  of  storing  and  retrieving  information  on  any  registered  com- 
pound within  the  CAS  system.    CBAC,  then,  is  a  tool  whose  purpose 
is  to  make  the  literature  of  its  defined  fields  more  readily  available 
to  the  scientist,  both  immediately  and  subsequently. 

CBAC  serves  a  field  in  which  there  is  a  great  deal  of  industrial, 
academic,  and  governmental  interest.    At  present,  digests  are  pre- 
pared from  articles  taken  from  350  English  journals,  70  German, 
38  Russian,  35  French,  18  Italian,  17  Japanese  and  20  journals  in  7 
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other  languages.    These  primary  journals  are  examined  as  rapidly  as 
they  are  received  at  Chemical  Abstracts  Service. 

To  be  defined  as  suitable  for  coverage  in  CBAC,  a  scientific 
paper  must  describe  original  work  in  one  or  more  of  the  following 
areas:    first,  the  effect  (or  effects)  of  organic  compounds  on  the 
physiology  or  biochemistry  of  systems  in  or  derived  from  plants, 
animals,  or  micro-organisms;  second,  metabolism  of  endogenous  or 
exogenous  organic  compounds  by  the  organisms  just  enumerated,  or 
by  systems  derived  therefrom,  and  third,  in  vitro  chemical  reactions, 
in  the  absence  of  a  biological  system,  between  organic  compounds  of 
biochemical  interest. 

A  more  detailed  discussion  of  the  criteria  used  in  selecting 
articles  for  CBAC  is  appropriate.    The  first  category,  the  effect  of 
exogenous  organic  compounds  on  organisms  or  systems,  probably  ap- 
plies to  the  largest  number  of  papers  selected  for  CBAC.    This  covers 
portions  of  the  work  done  in  almost  every  field  of  biochemistry,  for 
example,  enzymology,  endocrinology,  protection  against  irradiation, 
microbial  biochemistry,  mammalian  and  non-mammalian  biochemis- 
try, plant  pathology,  immunochemistry  toxicology,  pesticide  and 
herbicide  studies,  and  pharmacology. 

The  second  category,  dealing  with  metabolism,  includes  work 
in  much  the  same  areas  mentioned,  with  the  difference  that  here  the 
biological  organism  or  system  is  exerting  an  effect  on  a  compound 
rather  than  vice  versa.    Since  this  action  is  usually  described  or 
characterized  by  alterations  in  chemical  structure,  this  type  of  work 
is  generally  more  chemical  and  less  physiological  in  nature  than  the 
first  category. 

The  third  category,  concerning  in  vitro  reactions  between  or- 
ganic compounds  of  biochemical  interest,  deals  with  those  compounds 
not  directly  originating  from  a  biological  system  in  the  particular  ex- 
periment.   It  includes,  for  example,  studies  carried  out  using  com- 
mercial enzymes  in  synthetic  systems,  as  contrasted  with  the  use  of 
directly  biological  enzyme  systems  contained  in  homogenates  or 
metachondria.    Examples  of  other  types  included  in  this  category  are 
chemical  treatment  of  proteins  to  elucidate  their  structures  (when 
some  biological  activity  is  implied)  or  rupture  of  nucleotide  bonds  by 
chemical  treatment  in  vitro,  which  is  of  interest  in  the  mechanisms 
of  mutations. 

It  need  scarcely  be  pointed  out  that  the  interest  of  a  given  paper 
may  overlap  two  or  more  of  the  three  categories.    For  example,  work 
reporting  the  effects  of  an  antigen  on  a  host,  plus  the  fate  of  the  anti- 
gen, would  overlap  two  categories.    This  has  no  bearing  on  the  way  in 
which  information  is  abstracted;  all  chemical-biological  information 
is  treated  in  context.    As  a  result  of  the  biological  nature  of  the 
studies,  the  vast  majority  of  the  compounds  encountered  are  organic. 
However,  when  pertinent,  elements  or  inorganic  compounds  are  dealt 
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with  in  the  text.    For  example,  excretion  of  potassium  or  other  inor- 
ganic substances  under  the  influence  of  a  hormone  or  diuretic  would 
be  covered. 

CBAC  is  divided  into  four  sections,  viz.,  the  digest  section  and 
three  indexes— the  key  word -in -context  (or  KWIC)  index,  the  molecular 
formula  index,  and  the  author  index.    The  digest  section  contains  the 
core  information  (see  Figure  21).    The  digests  are  printed  in  such  a 
way  as  to  permit  rapid  scanning  for  subjects  of  interest  to  the  reader. 
The  names  of  organic  compounds  or  classes  of  compounds  appear  at 
the  left  margin  of  the  text  lines  so  that  their  names  may  be  read  by 
moving  the  eye  down  the  page.    To  the  immediate  left  of  the  name  for 
each  fully  defined  compound,  its  unique  Registry  Number  appears. 
The  Registry  Number  appears  only  once  for  each  compound  within  a 
given  digest. 

In  addition,  each  digest  contains  some  words  which  are  high- 
lighted by  being  printed  entirely  in  capital  letters.    These  are  terms 
which,  in  addition  to  the  organic  names,  are  felt  to  be  of  interest  in 
rapid  scanning.    They  include,  for  example,  names  of  elements  and 
inorganic  compounds  (as  these  do  not  necessarily  appear  at  the  left 
margin),  diseases  and  other  pathological  conditions,  names  of  micro- 
organisms (including  viruses),  names  of  organs  or  other  anatomical 
compounds  (mitochondria,  liver),  and  words  which  indicate  a  function 
of  interest  in  the  digest  (for  example,  oxidation  or  diuretic  effect). 

As  an  additional  aid  in  scanning,  structural  formulas  are  given 
in  the  digests.    These  are  selected  by  the  chemists  who  edit  the  di- 
gests.   The  selection  is  based  on  providing  a  clue  to  the  important 
type  (or  types)  of  organic  compounds  dealt  with  in  the  digests.    For 
example,  if  a  number  of  phenol  analogs  are  discussed  in  a  digest, 
only  one  example  of  a  phenolic  structure  would  be  selected  for  illus- 
tration.   A  "large"  structural  formula  is  given  only  once  in  an  issue 
and  subsequent  occurrences  of  the  compound  represented  are  cross- 
referred  to  the  first  appearance  within  the  issue. 

All  papers  selected  from  a  given  journal  are  printed  under  a 
bold  face  group  heading  which  gives  the  journal  name,  volume,  issue 
number,  and  year.    Journals  are  listed  alphabetically  by  Coden,  rather 
than  by  full  title.    Each  digest  is  headed  by  the  title  of  the  original 
paper  (if  in  a  language  other  than  English,  the  title  is  translated). 
Following  the  title  appears  the  complete  bibliographic  reference,  in- 
cluding Coden  notation,  volume  number,  issue  number,  year,  and  in- 
clusive page  numbers.    Names  of  authors  are  then  given,  by  last  name 
and  initials,  in  the  order  given  in  the  journal.    Finally,  the  research 
site  is  given,  when  known. 

All  digests  within  a  volume  of  CBAC  (13  issues)  are  numbered 
consecutively.    Each  sentence  in  the  digest  is  separated  from  the  pre- 
ceding sentence  by  a  blank  space.    All  sentences  except  the  first  are 
given  subnumbers.    The  numbers  and  subnumbers  are  used  in  the 
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KWIC  and  molecular  formula  indexes  to  give  a  precise  reference  to 
that  portion  of  the  digest  where  the  indexed  entry  appears.    Keywords 
are  selected  by  the  chemist  as  he  prepares  the  digest.    As  its  full 
name  implies,  the  KWIC  index  (see  Figure  22)  presents  keywords  in 
the  context  of  the  sentence  within  which  they  appear.    Thus,  the  user 
is  not  limited  to  the  information  contained  in  the  single  indexed  word 
("actinomycin,"  for  example,  may  be  indexed  several  times)  but  he 
can,  within  the  limits  of  a  phrase,  obtain  an  idea  of  the  context  in 
which  the  word  is  being  used.   The  indexed  entity  appears  in  all- 
capital  type  in  the  center  of  the  page,  with  its  associated  words  pre- 
ceding and  following  it  in  the  same  manner  as  they  do  in  the  digest  it- 
self.   All  index  entries  refer  the  user  to  the  digest  number  or  sub- 
number  from  which  they  were  generated.    The  KWIC  index  provides 
an  average  of  30  entries  to  each  digest. 

The  molecular  formula  index  presents  an  ordered  listing  of  the 
molecular  composition  of  organic  compounds  to  which  Registry  Num- 
bers have  been  assigned  (see  Figure  23).    To  the  right  of  the  formula 
appears  the  compound's  Registry  Number;  to  the  left  the  number  (or 
numbers)  of  the  digest  (or  digests)  in  which  the  compound  is  men- 
tioned.   The  reference  is  given  to  the  digest  number  (or  subnumbers) 
in  which  the  compound  first  appears;  subsequent  listings  of  the  com- 
pound within  the  same  digest  are  not  indexed.    The  molecular  formula 
and  Registry  Number  do  not  appear  opposite  each  digest  reference, 
but  only  opposite  the  first  reference;  subsequent  references  are  op- 
posite a  blank  space.    Molecular  formulas  are  listed  in  order  of  in- 
creasing numbers  of  carbon  atoms,  and  each  formula  is  arranged 
according  to  a  modified  Hill  system— carbon  first,  hydrogen  second, 
and  then  other  elements  alphabetically  by  atomic  symbols.    (Note  that 
in  this  system,  CC14,  which  has  no  H  atoms,  will  be  listed  before 
CHCLj  or  pure  hydrocarbons  such  as  C2H4.)    However,  the  molecular 
formulas  of  metallic  salts,  amine  salts,  and  addition  compounds  are 
listed  immediately  following  the  molecular  formulas  of  the  parent 
substances.    This  is  done  in  order  to  eliminate  the  separation  that 
would  otherwise  occur  between  parent  and  derivative.    Thus,  follow- 
ing the  molecular  formula  of  a  carboxylic  acid,  references  to  its 
sodium  salt  would  appear.    Even  though  no  actual  reference  to  the 
parent  compound  may  appear  in  the  given  issue,  its  molecular  for- 
mula will  be  given  in  the  index  to  facilitate  finding  the  molecular  for- 
mulas of  derivatives  which  are  referenced. 

Finally,  the  author  index  presents  an  alphabetical  listing  of  all 
author  names  in  the  issue,  each  name  being  matched  with  the  corre- 
sponding digest  number  (or  numbers).  The  three  indexes  are  cumu- 
lated and  issued  every  six  months  (corresponding  to  volume  indexes). 

The  steps  involved  in  producing  CBAC  are  briefly  as  follows 
(see  Figure  24).    Journals  which  are  covered  in  CBAC  are  scanned  in 
the  Assignment  Department.    A  biochemist  reviews  each  article  in 
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depth  to  determine  if  it  deals  with  one  or  more  of  the  subject  criteria 
used  in  the  selection  of  CBAC  oriented  material.    Digests  of  the  arti- 
cles selected  are  prepared  by  chemists  at  CAS.    All  articles  covered 
by  CBAC  are  also  covered  in  CA.    The  digests,  which  are  on  standard 
forms,  are  sent  to  our  Formula  Indexing  Department  where  registry 
sheets  (containing  structural  formulas)  are  prepared  for  each  of  the 
organic  compounds  with  defined  structures.    The  digest  information  is 
then  keypunched  into  cards  and  entered  on  magnetic  tape  by  means  of 
the  computer  system.    Paralleling  the  processing  of  digests,  the 
registry  sheet  information  is  being  processed  in  the  CAS  Registry 
System,  where  Registry  Numbers  are  assigned  to  the  compounds  by 
the  computer.    These  numbers  are  stored  on  tape  for  merging  with 
the  corresponding  compounds  in  the  text  of  the  digest.    Computer 
worksheets  for  each  of  the  digests  are  edited  and  any  corrections  re- 
entered  onto  the  computer  tape.    The  information  on  tape  pertaining 
to  the  digest  section  is  rearranged  by  the  computer  to  provide  the 
KWIC  index,  the  molecular  formula  index,  and  the  author  index. 
Pages  of  the  digest  section  and  the  indexes  printed  by  the  computer 
are  pasted  on  cardboard  sheets  and  photographed.    The  film  is  used 
to  prepare  plates  for  offset  printing  of  the  CBAC  issues. 

CBAC  is  constantly  being  evaluated  for  possible  improvements. 
A  case  in  point  is  an  additional  index  included  in  the  cumulated  in- 
dexes of  Volume  2  (end  of  1965).    This  is  a  Registry  Number  and 
Registry  Number  Cross-Reference  Index  that  will  not  only  permit  a 
search  of  the  digests  for  individual  compounds,  but  also  for  com- 
pounds whose  structural  features  are  similar.    For  example,  an  amine 
will  be  cross-referred  to  its  acid  salts  by  Registry  Numbers  and  vice 
versa. 

Starting  in  January  of  1966,  CAS  began  to  provide,  on  a  sub- 
scription basis,  the  magnetic  tape  records  of  CBAC,  to  be  searched 
at  the  subscriber's  location  or  at  CAS.    The  tapes  include  all  of  the 
information  contained  in  the  digest  section,  as  well  as  the  molecular 
formulas  of  the  registered  compounds.    This  permits  searching  of 
the  tapes  for  chemical  compounds  by  names  and  by  Registry  Numbers 
and  also  searching  for  syllables,  suffixes,  prefixes,  words,  or  phrases 
according  to  a  given  interest  profile.    Thus,  it  will  be  possible  to  con- 
duct a  search  for  chemical-biological  activity  correlations.    The  out- 
put of  the  search  will  be  bibliographic  references  (CBAC  digest  num- 
ber, original  journal  by  Coden,  volume,  issue,  and  page,  as  well  as 
titles  and  authors.)   A  KWIC  index  of  the  titles  cited  or  a  list  of  the 
authors  will  also  be  available,  if  desired. 

One  of  the  features  that  will  permit  great  flexibility  in  searching 
for  compounds  is  the  ability  to  use  Registry  Numbers  as  search 
terms.    This  method  will  permit  location  of  registered  compounds  in 
digests,  regardless  of  the  chemical  names  used  to  describe  the  com- 
pounds.   The  search  programs  permit  "and,"  "or,"  and  "not"  logic. 
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Weighting  factors  are  available  in  the  search  programs  to  increase 
the  relevancy  of  the  data  recovered. 

The  present  search  programs  are  designed  for  use  with  an  IBM 
1401  or  compatible  computer.    However,  the  search  is  being  repro- 
grammed  in  a  higher  language,  and  it  is  expected  that  it  will  be  com- 
plete by  May  1966.    The  new  programs  will  be  oriented  toward  the 
IBM  system  360  family  of  computers  and  should  permit  reprogram- 
ming  to  utilize  other  computer  configurations.    The  same  Coden  and 
Coden-volume  computer  checks  as  discussed  in  conjunction  with  CT 
are  applied  during  the  production  of  CBAC. 


CAS  Chemical  Compound  Registry 


It  was  mentioned  previously,  in  the  discussion  of  CBAC,  that 
Registry  Numbers  provide  a  specific  method  of  storing  and  retrieving 
information  on  registered  compounds  within  the  CAS  system.    The 
Chemical  Compound  Registry  is  the  heart  of  the  CAS  computer-based 
system.    The  Registry  files  must  contain  a  detailed  description  of 
each  compound.    At  the  minimum,  the  computer  record  of  each  com- 
pound includes  the  full  detail  of  the  conventional  structural  diagram. 
Chemical  nomenclature  does  not  form  an  adequate  basis  for  the  com- 
puter record.    The  Registry  file  also  supplies  the  basis  for  sub- 
structure searches— searches  for  fragments  of  molecules.    Again, 
nomenclature  does  not  provide  enough  details  for  direct  substructure 
searching  in  depth.    It  is  important  that  the  computer  files  include  no 
undetected  synonomy,  which  if  uncontrolled  would  increase  the  files 
to  unmanageable  proportions. 

A  tabular  description  of  the  structural  diagram  of  compounds 
is  used  in  the  Registry.    The  description,  which  is  called  a  connection 
table,  lists  each  of  the  atoms  in  the  molecule  atom-by-atom  and  bond- 
by-bond.    Programs  have  been  developed  to  manipulate  this  tabular 
description  so  as  to  convert  the  table  to  a  unique,  unambiguous  form. 
The  form  is  not  quickly  intelligible  to  the  chemist,  but  it  is  easily 
handled  by  the  computing  system.    Thus,  the  Registry  file  of  struc- 
tures consists  of  an  ordered  list  of  unique,  unambiguous  descriptions 
of  the  individual  compounds.    The  registration  process  amounts  to  an 
ordering  and  merging  by  computer.    The  purpose  is  to  determine 
whether  a  particular  structure  has  already  been  stored  in  the  system. 
If  a  compound  has  been  registered  previously,  the  same  Registry 
Number  is  supplied  as  output;  but  if  a  compound  has  not  been 
registered,  a  new  Registry  Number  is  provided. 
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Future  Computer -Based  Operations 


CAS  has  plans  in  preliminary  design  state  for  several  automated 
library  systems  capable  of  handling  a  large  input  of  documents  on  a 
very  short  time -span  basis.    This  effort  is  but  a  part  of  the  overall 
work  being  done  to  put  CAS  on  a  computer  base. 

The  master  record  tape  produced  for  the  publication  of  the 
Comprehensive  List  of  Periodicals  will  serve  as  the  basis  for  auto- 
mating many  of  our  existing  library  systems  (see  Figure  25).    Exist- 
ing serials  inventory  files,  source  files,  disposition  data  files,  and 
coverage  data  files  are  to  be  converted  into  machine -readable  form 
and  these  data  are  to  be  organized  in  disk  storage.    Once  the  data  are 
in  machine -readable  form,  it  will  be  possible  to  make  many  uses  of 
them.    The  library  applications  include  automated  serials  check-in 
and  rapid  retrieval  of  routing  or  disposition  information.    At  CAS  the 
latter  is  extremely  important.    Misrouted  serials  can  result  in  dupli- 
cation of  abstracting  or  in  papers  not  being  abstracted  at  all.    It  now 
takes  between  six  and  nine  months  to  train  fully  a  clerical  employee 
to  record  and  route  incoming  new  materials  with  minimum  super- 
vision.   Under  our  new  system  this  training  period  can  be  reduced  by 
the  length  of  time  it  takes  to  train  the  clerks  to  inquire  of  the  com- 
puter what  it  is  that  they  are  supposed  to  do  with  the  issues  at  hand. 

We  will  be  able  to  provide  printed  catalogs  of  the  serial  hold- 
ings and  receipts  with  coverage  data.    We  also  intend  to  reproduce 
our  existing  dictionary  card  catalog  in  machine -readable  form  and  to 
produce  printed  book  catalogs.    This  process  will  also  generate  the 
book  announcements  for  publication  in  Chemical  Abstracts. 
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IN  HOUSE  ABSTRACTORS  BY  CODE  FOR  VOL  63  02/21/66  PAGE   01 


CODE  NUMBER  OF NUMBER- -QE- 

ABSTRACTS  PATENTS 


BBJN  350 

BCJD  _   1 

BCJG  1 

BCJN  263 

BCPC  1 

BDJF  3 

BDJL  12 

BOJN 

BDPF 

BOPL 

BDPN 

BFJN 

BGJF 

BGJW 

BGJN 

BGJR 

BGJV 

BGPV 

BHJN 

BKJG 

BKJN 

BLJB 

BLJC 

BLJD 

BLJF 

BLJH 

BLJM 

BLJN 

BLJR 

BLJS 

BLJT 

BLJV 

BLJN 

BLTJ 

BMJG 

BMJR 

BMJW 

BMJX 

BMRJ 

BNJR - 

BPJF 
BPJN 
BPPF 
BSJG 
BSJN 
BSPG 
BTJF 
BTJG 
BTJH 
BTJL 
BTJN 


902 
30 

902 
30 

88 
323 

88 

51 
1 

721 
1 

A 
1 

1 

166 
153 

195 
75 

6 
66 

5 
1 

1 

s 

856 
1 

1 
1 

21 
1 

2 

24f» 

18 

1 

1 
1 

15 
91 

3 

8 

3 
2 

36 
*7 

37 

103 
40 

1 
«; 

16 

Figure  3 
Example  of  Computer  Printout  Inventory  of  CA  Abstracts  by  In-House  Abstractor 
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IN   HOUSE    ABSTRACTORS    BY   CODE_BY_SECT.10N_VOL   63 02/2L/66_PJkGE 01 


CODE 

GLJR 
RCFQ 
RCTT 
RCVO 
RCVQ 

NUMBER  OF                    NUMBER  OF 
ABSTRACTS                       PATENTS 

SECTION   1 

1 

2 
5 

1 

1 

SFTT 
SGTT 
SHTT 
SNOC 

1 
5 

1 
1 

SNTT 
SXTT 

52 
1 

SZTT 

TCNG 

2 
1 

TCSL 
VNJZ 

SECTION  TOTAL 

1 
7 

82 

SECTION   2 

BOPN 
BGJV 

1                           1 
1 

BLJB 
BLJC 

2 
2 

BLJO 
BLJR 

8LJS 
BLTJ 

2 
10 

1 
1 

BMJR 
BSPG 

23 
1                           1 

BTJQ 
BWJY 

2 
1 

BYJY 
GBJR 

53 
6 

GHPF 
GKJY 

I                           1 
2 

GLJR 
GHJR 

13 
3 

GPJR 
GPJW 

3 
3 

GRJF 
GRJG 

1 
2 

GZJR 
HSJR  _ 

7 
1 

MCCR 
MCRK 

1 
1 

Figure  4 

Example  of  Inventory  of  CA  Abstracts  by  In-House  Abstractors,  Arranged  by 

Section  of  CA 
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CODEN 

AAAH 
AAAM 
AABC 
AABF 
AABI 
AABN 
AABQ 
*AABS 
AACH 
AACI 
AACR 
AAFA 
AAFB 
AAFP 
AAFR 
AAGE 
AAGP 
AAGS 
AAGW 
AAHO 
AAKG 
AALS 
AALT 
AAML 
AAMM 
AANE 
AANI 
AANL 
AAOS 
AAPL 
AAQA 
AARE 
AASC 
A  ASH 
AATF 
AATR 
AATS 
AAUP 
ABAB 
ABAH 
ABAI 
ABBI 
ABBS 
ABCH 
ABCL 
ABCO 
ABCZ 
ABEB 
ABEG 
ABEM 


CHEMICAL  ABSTRACTS  SERVICE 


CODEN  BASED  INVENTORY  REPORT   NUMBER  1 
SOURCE  OF  ABSTRACTS 


NUMBER  OF  ABSTRACTS 


CA  VOLUME  63 

PAGE    1 


9 
1 
8 
2 

33 
9 
7 
3 

79 
7 
6 
3 
5 

13 
3 

18 
4 
5 
9 
1 
1 
1 
2 

38 

11 
6 
1 

41 
1 
2 

33 
2 
3 
1 
9 
1 
3 
1 

19 
9 
1 

184 

3 

102 

18 
6 
1 
3 
5 
4 


7  BY  CA      1   BY  OTHER  ABSTRACT  JOURNALS 
29  BY  CA      4   BY  OTHER  ABSTRACT  JOURNALS 

1  CREDITED  TO  RUSSIAN  JOURNALS 


1  BY  CA 


2   BY  OTHER  ABSTRACT  JOURNALS 


NONE  BY  CA 


37  BY  CA 


1  BY  CA 
1  BY  CA 


4   BY  OTHER  ABSTRACT  JOURNALS 


8   BY  OTHER  ABSTRACT  JOURNALS 
2   BY  OTHER  ABSTRACT  JOURNALS 


100  BY  CA      2   BY  OTHER  ABSTRACT  JOURNALS 


1  BY  CA      3   BY  OTHER  ABSTRACT  JOURNALS 


*  INDICATES  SECONDARY  JOURNALS 


Figure  5 
Example  of  Inventory  of  CA  Abstracts  by  Source  in  Coden  Order 


-  ? 
3 


149 


•  2 

""  1 

<  * 

§  CD  < 

Z  *    *  *    *  *  *    *  **«•*  *          *    *    *  *          *  *•»***  o 

Of  -J 

.  j  S 

OUJ3MfiCUJU^OOTKX<ZH<li.u<£|UIONKXXl^~7ZXt/>KOXHA<liJV>XOV)XCO)CXHU  X 

»a<<<82<o-^GLXko£o^-ob)uae<i/)OZOZ<ou.(kXtt03«acujxa.-)X3<aeo<5z  vi 

>u.  S 

*  *"  -it)*^«h.a>o-il*^<lP-J>o-o-.<Mm*«ol-a>  2 


O 


O    >> 
H  h 


a. 

S 

S 
w 


150 


Number 
of  Journolt 


AMMtCAN  CHIMICAl  lOCIIFY 
CHfMICAl  AIITVACri  SMVfCf 

JOURNAL   PRODUCTIVITY 
IN    CHEMICAL  ABSTRACTS 


v,uw 
5,000 

1,000 

500 

too 

50 

l/V 

;- 

S 

X 

, 

^ 

x^^ 

^ 

^ 

s 

S 

9< 

/ 

>      102030405060708090101 

Ptrc«nt  of  Total  Abstracts 

Figure  7 

Four-Year   Cumulation   or   Semi -Log   Paper   of  Journal 
Productivity  of  Abstracts 
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222 
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(ABSTRACT  NO.)  (ABSTRACTOR! 

(REQUESTED   BY)  ( DATE )  (OUT  TO) 

NOT  YET  REC'D  Q  NOT  REC'D 

NOT  ON  SHELF  -  NOT  CHECKED  OUT  TO  STAFF  Q 

VOL  SECT.  NO.  PAGE 


Figure  9 
Library  Call  Slips 
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CAS  COOEN  DIRECTORY.  TITLE  SEQUENCE  PACE     1 

CODEN  TITLE 

ACHY        ABHANOLUNGEN  AUS  OEM  GESANTGEBIETE  DER  HYGIENE. 

AMEH        ABHANOLUNGEN  AUS  OEM  INSTITUT  FUR  MET ALLHUETTENIIESEN  UNO 
ELEKTROMETALLURGIE  OER  TECHNISCHEN  HOCMSCHULE,  AACHEN. 

AP«P        ABHANOLUNGEN  OER  PREUSSISCHEN  AKACEMIE  OER  KISSENSCHAFTEN. 
PHVSIKALISCH-MATHEMATISCHE  KLASSE 

ABM8        A.B.M.  BOLETIN  (ASSOCIACO  BRASILEIRA  OE  METAID  (SAO  PAULO)) 
A.B.M.  BOL.  ASSOC.  BRASIL.  METAIS  (SAO  PAULO) 

AVCM        ABRASIVE  AND  CLEANING  METHODS. 

APKR        ABRIDGED  SCIENTIFIC  PUBLICATIONS  FROM  THE  KODAK  RESEARCH 
LABORATORIES. 

ATPM        ABSTRACTS  OF  SCIENTIFIC  AND  TECHNICAL  PUBLICATIONS  FROM  THE 
MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY. 

BIOCHIMIE.  STUDII  SI  CERCETARI  DE  BIOCHIMIE 
ACCF        ACCION  FARMACEUTICA 

AQNR        ACQUISITIONS  MEDICALES  RECENTES.  JOURNEES  MEDICALES 
ANNUELLES  OE  BROUSSAIS  -  LA  CHARITE. 

ABIW        ACTA  BIOLOGICA  ET  MEOICA  GERMAICA.  SUPPLEMENT  BAND 

ACUJ        ACTA  ET  COMMENT ATI  ONES  IMPERIALIS  UNI VERSIT ATIS  JUR IEVENSIS 
(DORPATENSIS) 

ATUA        ACTA  ET  COMMENT AT  I  ONES  UNIVERSITAT IS  TARTUENSIS 
(OORPATENSIS)  A.*  MATHEMATICA.  PHYSICA.  MEOICA 

ATUC        ACTA  ET  COMMENT ATI  ONES  UNIVERSI TAT  IS  TARTUENSIS 
IDORPATENSISI  C.»  ANN ALES 

ALSM        ACTA  LEIOENSIA.   SCHOLAE  MEOICINAE  TROPICAE 

ALUM        ACTA  LITTERARUM  AC  SCIENTIARUM  REGIAE  UNI VERSIT ATIS 
HUNGARICAE  FRANCISCO- JOSEPH  IN AE.  SECTIO  MEDICORUM 

AMTO        ACTA  NIPPONICA  MEOICINAE  TROPICALIS. 

APAI        ACTA  PHYSIOLOGICA  ACADEMIAE  SCIENTIARUM  HUNGARICAE. 
SUPPLEMENTUM 

APVP        ACTA  PHYSIOLOGICA  POLONICA 

LITTERAE  SOCIETATIS  PHYSI OLOGORUM  POLONORUM 
ACTA  PHVSIOL.  POLOS. 

APXG        ACTA  PNYTOTAXONOMICA  ET  GEOBOTANICA. 

APQP        ACTA  PSIOUIATRICA  V  PSICOLO6ICA  DE  AMERICA  LATINA 


SEQ.  NO. 
000385  01 


00042S  01 
000429  02 


000570  01 


000700T01 
OO07OOTOI 


000730  01 


OOO76S  01 
000765  02 


OO1O9S  01 
001095  02 


001960  01 
001960  02 


002690  01 


002930  01 
002930  02 


003630  01 


004045  01 
00404S  02 


OO4050  01 
004OSO  O2 


004O5S  01 
OO40S5  02 


004890  Ol 
004890  02 

005770  01 

006530  01 
006530  02 

ooasaoToi 

006SSO  02 
006S80T01 

006670  01 
00701O  Ol 


Figure  11 
Sample  Page  of  Title  Sequence  of  CAS  Coden  Directory 
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List 
of 

Changes 

KfV 
/K 


Revised 

CAS  CODEN 

Directory 

Listing 


File  1-  Items  to  be  purged 

2-  Lift  of  words  to  be  printed 
in  all  lower  cast. 

3-  List  of  words  to  be  printed 
in  all  upper  case. 

4-  List  of  words  to  be  abbreviated 
per  ASA239.S 


Figure  12 
Flow  Chart  for  Revision  of  Present  CAS  Coden  Directory 
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File  Generation -COOEN  Additions 


Figure  13 
Flow  Chart  for  Preparation  of  Master  File  of  Periodicals  with  Coden 
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Checking  Edition  Preparation 


Match  ALA 

Cards  Against 

Title     File 


Figure  14 

Flow  Chart  for  Preparation  of  Checking  Edition  of  List  of 
Periodicals 
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Library  Holdings  Tabulation 


ALA 

Checklist 


(Libraries    \ 
Record 
Holdings    J 


Figure  15 

Flow   Chart  for   Adding   Library 

Holdings  of  Periodicals  to 

Computer  Record 
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Work  Flow  for  Periodicals  List 


/ 

/ 

Biblio- 

Serial 

List 

of 
Periodicals 

/ 

graphy 

Monograph 

(5X8) 

/ 

OK 


/Transcribe  onto\ 
I  Coding  Sheet    } 

~T 


|36O 

Merge 
Information 

Proof  Copy 

-  ^^ 

Make 
Corrections 


|36O 

Merge  in 
Library  Holdings 
Information 

Serials 
Listing 
w/  Holdings 

Figure  16 

Flow  Chart  for  the  Preparation  on  Tape  of  the  Final  Com- 
plete List  of  Periodicals 
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Prepare  Specialized   Listings 


Figure  17 

Flow  Chart  for  the  Preparation  of  Special  Lists  of 
Periodicals 


CHEMICAL  TITLES 
KEYWORD-IN-CONTEXT   INDEX 


WITH   AROMATIC    ALDEHYDES,    AND    THE    GEOMETRICAL 

WITH   ALIPHATIC    ALDEHYDES,    NALONIC    ESTERS,    AND 

Of    SETTING    FREE    THE    ALOEHYDIC    GROUP    IN    PYRIOOXAL.- 


JSOO- 
JUAR- 


ESTER 


AND   ORALLY    ADMINISTERED  ALDO    STERONE.*    EVIDENCE    FOR    EXTRA  JCIN- 

RIMSE-5-PHOSPMATE  ALOOLASE    IN   THE    SYNTHESIS   OF    OEOXY  6BRC- 

OF    METHYL    OOP*    I  ALOOHETI    ON   CARDIOVASCULAR    RESPONS  AIPT- 

TIVES    «    CONDENSATION   OF  ALOOSES    AND   THEIR    ALDEHYDRO   DERIVA  JPNS- 


-1966-0283 


0022-0062 
0159-0215 
00*5-0021 


Figure  18 
Sample  of  KWIC  Index  to  Chemical  Titles 
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CHEMICAL  JIHES 


BIBLIOGRAPHY 


J.  Clln.  lnv..r.,  43,  No.  1  (1966) 


SCHKCfOEH    f« 


JCIM-1-0<H«-026«      tLtOSOE 


isitno  or 

KO"OX[NID1S     G 

KDSCHOt    CB 
NflliKS     HK 

!?*-•• 

Figure  19 
Sample  of  Bibliography  Section  of  Chemical  Titles 


CHEMICAL  JULES 


AUTHOR  INDEX 


BUEDEN    HR         PMi.T-A-0019-0712 
PMlT-»-0019-0708 
•LOCH   0  ElCA-A-001 1-0093 

BLOEHBERGEN    N 

MtLT-Jt-0016-OOtl 
BLOHOUIST  AT  JOCE-A-0031-033B 
BLOODKORTH  »J 

JSOO-A-1966-0299 
tlOOM  H  AJCH-A-OOie-2039 

BlOOHf IEIO  C  JSfA-A-OOlT-0019 
6LOOHFIELD  0  JCIN-A-OMS-0264 
BIOSSEY  RC  T»MM-A-0236-00?8 


Figure  20 
Sample  of  Author  Index  to  Chemical  Titles 
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CHEM/CA/.-B/O/.OG/CAL  ACT/V/T/ES 

ILLUSTRATIVE  DIGEST 

J.  Blol.  CH«m.,  «•».,  No.  at  (1»««) 

Zanettl    G,    Fortl    G;Unlv.    Milan. 

Tha    kinetic    properties   of    a    spinach    LEAF    CHLORGPLAST    crystalll 
Flevoproteln    enzyeie,    both    at 

146145        FAD,    and    reductive    I  nee t Iva t Ion,    >ere    studied. 
259-1  Kinetic    reaction    studies    of    the    activity   of 

53576        TPHH, 

259-2  Tha  REDUCTION  of 

Ferrlcyanlde  by  the  enzys>e  gave  a  Ka  of  30.7  •lcro»olar  and  In 

to  for. 

269-3  The  DIAPHOKASE  ACTIVITY  of  the  enzyae  nas  reverslbly  Inactlvat 

prelncubat Ion  olth 
TPNH; 
TPN  preveh'ted  and  reversed  this  INACTI VATION. 

259-4       57136    Urea  Irreversibly  Inactivated  DlAPHORAbF  ACTIVITY  of  the  enzyi 
converted  the  reversible  tnactlvatlon  by 

2b9-5  The  activity  of 


H-C-OM 

H-C-OM  *       H-C-OH 
M-C 


H-C-OM  I 

H— J 


53576 


Figure  21 
Sample  Page  of  the  Digest  Section  of  CBAC 


CHEM/CAL-B/OLOG/CAL  ACTIVITIES 


KKYWORD-IN-CONTKXT  INDBX 
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253-7 

259 

387-3 

36 
387-6 

8 

288-1 
383 

38-10 
431 
431 
431-4 

38-9 
387-1 
387-1 


OCHONDRIA  fraction**   When  the 

properties  of  •  spinach  LEAF 

ORYLATION  decreased  to  zero  in 

STRUCTURAL  INTEGRITY  of  the 
ADP  PHOTOREDUCTION  by  Isolated 

bonate,  Isolated  whole  spinach 
REASED  PHOTOPHOSPHORYLATION  BY 
Y  MECHANISM  that  «ould  prevent 


d  In  c 

Pyr 

>t  80 


CHLOROPLAST  and  MITOCHONDRIA  fra 
CHLOROPLAST  crystalline  Flavopro 
CHLOROPLAST  fragments  fron  LEAVE 
CHLOROPLAST  STRUCTURE  AND  FUNCTI 
CHLOROPLAST,  permitting  the  fine 
CHLOROPLASTS  by  200-350*.*  »  p 
CHLOROPLASTS  by  -ashing  or  hypot 


•ed  a  ««all 
rom  CALLUS  past  th 
RON  CHINESE  CABBAG 


CHLOROPLASTS 
CHLOROPLASTS 
CHLOROPLASTS 
CHLOROPLASTS 
CHLOROPLASTS 
CHLOROPLASTS 
CHLOROPLASTS 
CHLOROPLASTS  fron  LEAVES  fl 


roai  functioning  al 
ro»  g roving  CALLUS 


Figure  22 
Sample  of  KWIC  Index  to  CBAC 


CME/MICAl-lfOlOGICAl  ACTIVITIES 

MOLBCULAtl  FOMMULA  IMDBX 

DIGEST   MOLECULAR  REGISTRY 
NUMBER    FORMULA    NUMBER 


20-4 
22-5 
77-11 

259-1 

297-3 

411 
82 

166-1 


Figure  23 
Sample  of  Molecular  Formula  Index  to  CBAC 
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CHEMICAL-BIOLOGICAL  ACTIVITES 
(CBAC) 


Journal 
Acquisition 

1 

Selection  of 
Papers 

1 

Digest  and 
Structure 
Preparation 

OWCST 

X7                   \ 

STRUCTURE 
"V 

Editing 

Registration 

1 

/ 

Keypunching 

\ 

Merge 

1 

Computer 
Processing     and 

Typesetting 

1 

Publication 
of   CBAC 

Figure  24 
Flow  Chart  for  the  Preparation  of  CBAC 
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Master  Random  Access  File 


Auxiliary  Information  Files 


Figure  25 
Flow  Chart  for  Input  and  Output  of  Master  Random  Access  File 
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Appendix  2 

CAS  Computer  Configurations 


1.  IBM  360  Model  40 

32  K  storage 

4        7 -channel  tape  drives  (30  KB  tapes) 

1  9 -channel  tape  drive 

2  printers 

2.  IBM  7010 

80  K  storage 

8        7 -channel  tape  drives 

1301  disk  file 

3.  Two  remote  terminals— IBM  1050' s 

4.  Increase  in  storage  capability  of  360  Model  40  to  128  K— 
March  15,  1966 

5.  Full  operating  system  for  the  360  Model  40,  disk  file  and 
floating  point  arithmetic— April  1966 


DATA  PROCESSING  IN  THE  TEXAS  A  &  M  UNIVERSITY  LIBRARY 


Bruce  W.  Stewart 


The  Texas  A  &  M  University  Library  embraced  automation  as  a 
way  of  life  when  it  became  the  first  library  in  the  Southwest  to  employ 
a  Data  Processing  Supervisor  as  a  full-time  Library  staff  member  in 
September,  1964.    The  creation  of  such  a  position  as  part  of  the  Li- 
brary staff  was  only  one  of  several  favorable  circumstances  which 
combined  to  provide  the  necessary  foundation  for  the  achievements 
outlined  in  this  paper. 

In  addition  to  an  enthusiastic  University  administration  which 
provided  requested  supplemental  funds  for  a  special  conversion  pro- 
ject, the  Library  has  access  to  the  University's  centralized  data  pro- 
cessing facility,  which  is  one  of  the  largest  such  University  instal- 
lations in  the  Southwest.    The  Data  Processing  Center  houses  an  IBM 
7094-1401  computer  system  with  14  magnetic  tape  drives,  two  sepa- 
rate off-line  1401  tape  systems  (one  with  a  1404  printer),  and  a  battery 
of  high  speed  sorters,  collators,  and  card  punches.    This  tremendous 
hardware  capability  has  proved  to  be  a  great  asset  to  our  automation 
program. 

A  second  asset  of  a  more  subjective  nature  is  the  Library's 
membership  in  a  co-operative  effort  involving  the  libraries  of  Texas 
A  &  M,  Rice  University,  and  the  University  of  Houston.    While  ninety 
miles  separate  College  Station  from  the  University  of  Houston  and 
Rice  in  Houston,  the  association  has  proved  very  beneficial  to  each 
library.    In  the  fall  of  1964  the  three  libraries  met  to  discuss  their 
mutual  interest  in  automation.    Since  available  computer  equipment 
varied  tremendously  with  each  University,  an  agreement  was  reached 
to  share  ideas  and  co-operative  effort  at  the  systems  and  design  level, 
with  meetings  to  be  arranged  roughly  three  times  per  year.    Each  li- 
brary reviewed  its  most  pressing  needs,  and  the  final  result  was  that 
the  University  of  Houston  Library  decided  to  tackle  the  acquisitions 
operation,  Rice  University  Library  was  already  preparing  to  automate 
its  circulation  operation,  and  Texas  A  &  M  elected  to  attack  the 
serials  control  problem. 


Bruce  W.  Stewart  is  Coordinator  of  Data  Processing  for  The  Library, 
Texas  A  &  M  University. 
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Probably  the  most  outstanding  result  of  the  co-operative  agree- 
ment, in  addition  to  the  strong  personal  relationships  that  developed, 
was  that  Texas  A  &  M  was  able  to  justify  and  implement  an  automated 
circulation  system  in  an  extremely  short  time  due  to  the  sharing  of 
both  experience  and  advice  of  the  Rice  Library.    Although  the  two 
circulation  systems  are  related  only  by  the  IBM  357  Data  Collection 
Units  used  by  each,  considerable  time  and  effort  were  saved  by  Rice's 
generosity.    With  implementation  of  Texas  A  &  M's  Automated  Serials 
Control  operation,  the  University  hopes  to  be  in  a  position  to  repay 
its  debt  to  the  co-operating  institutions. 

The  Texas  A  &  M  University  Library  is  wholeheartedly  involved 
in  data  processing  applications  and  the  automation  of  clerical  oper- 
ations.   While  not  concerned  immediately  with  the  more  sophisticated 
area  of  information  retrieval,  the  Library  looks  forward  to  the  time 
when  such  systems  will  become  a  reality.    Although  the  two  appli- 
cations just  mentioned,  serials  and  circulation,  represent  the  two 
major  areas  of  automation,  several  other  routine  functions  also  utilize 
punched  cards.    The  Library's  physical  property  inventory  is  main- 
tained with  a  punched  card  system  which  allows  rapid  reconciliation 
with  the  University's  inventory  records  at  the  close  of  each  fiscal 
year.    The  Acquisitions  Department  also  maintains  a  punched  card 
file  by  purchase  order  number,  coded  to  allow  rapid  breakdown  of 
outstanding  encumbrances  by  categories  such  as  books,  serials,  and 
others.    Finally,  Acquisitions  runs  an  annual  analysis  of  vendor  per- 
formance (primarily  the  state  contract  jobber  which  changes  from 
year  to  year)  by  punching  and  analyzing  on  an  IBM  1401,  data  which 
provides  the  percentage  of  books  delivered  by  five  day  increments 
and  the  percentage  of  books  delivered  for  each  discount  received. 


Automated  Circulation  System 


In  February  1965,  the  University  approved  a  supplemental  grant 
to  the  Library  in  response  to  a  proposal  which  outlined  and  justified 
the  conversion  necessary  to  implement  an  automated  circulation  sys- 
tem.   There  were  two  major  steps  involved  in  the  initial  conversion 
of  our  collection  for  the  automated  system,  viz.,  keypunching,  and 
gluing  pockets  and  inserting  cards. 

Of  the  Library's  half -million  volumes,  it  was  necessary  to  pro- 
duce IBM  bookcards  for  a  circulating  collection  of  approximately 
200,000  volumes.    This  represents  the  circulating  collection  in  the 
main  library  only;  branch  libraries  were  not  included  in  the  con- 
version or  in  the  plans  for  automation.    The  branches  will  be  absorbed 
into  a  new  $3,800,000  library  building  to  be  completed  in  1968;  at  that 
time  the  necessary  IBM  bookcards  will  be  keypunched  in  another 
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crash  project.  In  addition,  since  bound  serial  holdings  are  not  circu- 
lated, it  was  unnecessary  to  produce  a  machine  readable  bookcard  for 
the  additional  200,000  bound  serial  volumes  in  the  main  library  build- 
ing. 

After  several  methods  of  producing  the  necessary  machine 
readable  identification  for  each  circulating  volume  were  investigated, 
local  keypunching  under  the  Library's  own  management  was  selected 
as  the  most  practicable  approach.    Alternate  methods  considered  in- 
cluded optical  scanning  of  pencil  coded  sheets  and  contract  conversion 
by  a  private  firm.    Neither  of  these  possibilities  was  acceptable,  how- 
ever, either  because  of  simple  unavailability  of  the  service  desired 
or  because  of  the  problem  of  shipping  irreplaceable  shelflist  trays  to 
and  from  the  contract  keypunching  firm  ninety  miles  away.    The  de- 
ciding factor  in  the  final  decision  was  the  availability  of  the  Universi- 
ty Data  Processing  Center's  keypunches  on  a  second  shift  basis  at  no 
cost  to  the  Library. 

Five  girls  were  hired  at  $1.25  per  hour  to  work  4-1/2  hours 
each  weekday  night,  5:30  p.m.  to  10:00  p.m.,  plus  3-1/2  hours  on 
Saturday  morning.    The  girls  read  directly  from  the  shelflist  cards  in 
their  catalog  trays,  which  were  carried  to  the  Data  Processing  Center 
and  back  each  evening.    The  girls  typed  the  accession  number  and 
call  number  for  each  bound  volume  except  for  those  books  which  be- 
cause of  classification  number  or  shelf  card  notation  were  known  to 
be  non- circulating,  entering  an  average  of  25  keystrokes  per  card. 
Figure  1  illustrates  the  full  bookcard  format.    Only  one  of  the  girls 
was  an  experienced  keypunch  operator  although  all  were  capable 
typists.    By  the  end  of  the  first  week  they  were  averaging  1,000  cards 
per  night  per  girl  and  maintained  this  average  for  approximately 
seven  weeks,  the  time  required  to  complete  the  project.    Card  output 
was  limited  not  by  punching  speed  but  by  the  time  required  for  the 
girls  to  interpret  each  shelflist  card  according  to  prescribed  in- 
structions. 

The  keypunched  cards  were  kept  in  shelflist  order  and  periodi- 
cally listed  on  an  IBM  1401.    The  listings  were  then  sight-verified 
against  the  original  shelflist  cards  by  two  additional  girls  employed 
full-time  during  the  day  for  the  duration  of  the  project.    Error  cards, 
omitted  cards,  and  cards  for  non- circulating  volumes  were  pulled  or 
repunched.    All  original  cards,  which  were  punched  in  regular  blank 
card  stock,  were  then  input  to  an  IBM  1401  program  which  scanned 
the  fields,  adding  fill  periods  and  357  control  punches,  and  then 
punched  the  final  card  into  the  Library's  Master  Bookcard  stock. 
This  program  also  printed  each  volume's  accession  number  on  a 
gummed  1/4"  by  1-1/4"  label. 

The  final  master  cards,  kept  in  shelflist  order  in  labeled  boxes 
with  the  corresponding  gummed  labels,  were  then  run  through  an  IBM 
557  interpreter.  Costs  for  keypunching  and  verifying  were  as  follows: 
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Wages 

Keypunching:    March  22  -  May  11,  1965 

Sight  verifying:    April  5  -  May  28,  1965  $1800 

Supplies 

200,000  Master  Bookcards  $  410 

217,000  Stock  Tab  Cards  $  255 

200,000  Pocket  Labels  $  260 

Non-billed  Operations 

Listing  on  IBM  1401  for  verification  15  hours 
Reproducing  for  Master  Card  on  IBM 

1401  20  hours 

Interpreting  on  IBM  557  40  hours 

The  book  collection  lacked  pockets  which  were  glued  in  by  high 
school  students  at  $.75  per  hour.    The  boys  worked  from  8  a.m.  to 
5  p.m.  in  unair conditioned  stacks  during  the  summer  months.    Work- 
ing in  teams  of  two,  ten  boys  averaged  7500  books  per  day.    The  glu- 
ing operation  was  virtually  complete  in  five  weeks. 

These  students  plus  an  additional  five  people  were  also  utilized 
to  match  the  Master  Bookcard  to  the  book,  insert  it  in  the  pocket,  and 
attach  the  accession  number  label  to  the  pocket.    The  insertion  oper- 
ation was  in  simultaneous  operation  by  the  second  week  in  June  and 
continued  to  August  20.    Most  of  September  was  also  spent  with  one 
regular  library  staff  member  working  full-time  on  the  backlog  of 
snags. 

Costs  for  the  gluing  and  insertion  operation  were  as  follows: 

Wages 

Gluing  Pockets:    June  1  -  July  7,  1965 

Match  and  Insert:    June  15  -  August  20,  1965  $4600 

Supplies 

200,000  Bookpockets  $991 

5  Gluing  Machines  $     55 

30  gallons  glue  $  105 

Roughly,  the  total  conversion  cost  was  approximately  3.9jd  per 
book  and  the  project,  start  to  finish,  took  five  months. 

In  addition  to  the  conversion  of  the  collection  it  was  necessary 
for  each  Library  patron  to  have  a  plastic  card  with  borrower  identifi- 
cation in  machine  readable  form  punched  into  it.    The  Registrar's 
office  already  provided  a  student  identification  card,  which  was  modi- 
fied to  IBM  357  system  specifications.    This  card  is  custom  laminated 
by  the  University  and  embossed  in  the  Registrar's  office  with  student 
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permanent  number,  name,  and  birth  date.  Each  currently  enrolled 
student  receives  a  new  card  at  the  beginning  of  each  fall  semester, 
which  provides  automatic  yearly  validation. 

All  other  non- student  users,  including  University  faculty  and 
staff,  faculty  and  staff  family,  and  local  area  and  special  users  are 
issued  a  permanent  factory-produced  card  purchased  by  the  Library. 
These  are  non-embossed  cards  on  which  the  name  and  user  number 
are  handwritten.    All  non- student  users  are  required  to  provide  their 
Social  Security  number  as  identification.    A  library  user  thus  either 
presents  his  student  ID  card  to  be  punched  or  requests  a  Library 
Card  the  first  time  he  enters  the  Library  during  the  year.    The 
patron's  name,  mailing  address,  and  identification  number  are  re- 
corded for  keypunching  and  addition  to  a  magnetic  tape  file  of  names 
and  addresses.    The  punched  ID  card  is  immediately  returned,  and 
the  patron  is  ready  to  charge  his  book.    The  six  digit  student  identifi- 
cation number  is  recorded  with  three  trailing  zeros,  and  an  additional 
prefix  number  is  assigned  to  both  Social  Security  and  student  numbers 
to  indicate  the  status  of  the  borrower  for  later  analysis  of  Library 
usage. 

The  data  processing  equipment  in  the  Library  consists  of  an 
082  Sorter,  an  085  Collator,  an  026  Keypunch,  an  013  Badge  Punch, 
and  the  IBM  357  Data  Collection  System.    The  IBM  357  system  in- 
cludes two  model  six  input  stations  with  cartridge  manual  entry, 
mounted  at  the  Circulation  Desk  on  either  side  of  a  358  control  unit 
and  receiving  keypunch  which  are  enclosed  in  a  soundproof  plywood 
box  built  into  the  desk.    The  regular  026  Keypunch  services  the  needs 
of  other  Library  departments  upon  request  as  well  as  all  regular 
processing  of  new  acquisitions  and  other  Circulation  Department  key- 
punching. 

A  brief  description  of  operation  of  the  IBM  357  system  is  as 
follows:    A  patron  presents  a  book  to  be  checked  out  with  a  valid  cur- 
rent identification  card.    The  Circulation  clerk  selects  the  correct 
date  due  cartridge  to  be  used  for  the  charge  and  inserts  the  bookcard, 
identification  card,  and  cartridge  to  activate  the  IBM  357  input  sta- 
tion.   The  output  keypunch  produces  two  cards,  each  containing  book 
identification,  borrower  identification,  and  date  due  (see  Figure  2). 
The  two  cards  are  uniquely  coded  in  cc  80.    The  first  card  is  placed 
in  the  pocket  with  the  bookcard  as  proof  of  charge  and  as  notification 
to  the  borrower  of  date  due.    The  second  card  remains  on  the  key- 
punch stacker,  and  the  accumulated  charge  cards  are  input  to  the 
nightly  computer  processing  run. 

As  books  are  returned,  they  are  held  on  a  booktruck  with  their 
corresponding  return  cards.    Immediately  before  the  truck  is  released 
for  shelving,  the  return  cards  are  sorted  into  accession  number  order 
and  matched  with  the  collator  against  a  deck  containing  one  card  (see 
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Figure  3),  for  each  book  on  Hold.    The  Hold  cards  are  likewise  main- 
tained in  accession  number  order  and  contain  only  the  accession  num- 
ber in  machine  readable  form.    Any  matching  cards  that  merge  flag 
books  that  are  to  be  removed  from  the  truck  before  it  is  released  to 
the  stacks.    While  a  low-volume  operation,  the  sorter  and  collator  in 
the  Library  are  necessary  even  in  a  computer-based  system  such  as 
this  one,  not  only  for  Circulation  processing  but  for  various  other 
IBM  card  filing  routines  that  can  be  done  more  conveniently  on  a  me- 
chanical basis. 

A  manual  back-up  charging  procedure  is  available  in  the  event 
the  IBM  357  system  breaks  down,  or  in  the  event  a  book  is  presented 
at  the  Circulation  Desk  with  the  Master  Bookcard  missing  or  muti- 
lated.   The  procedure  utilizes  a  three-part  IBM  card-size  form  (see 
Figure  4),  which  is  used  manually  to  record  book  identification,  bor- 
rower number,  and  date  due.    One  part  then  goes  with  the  book  in 
place  of  the  Return  Card,  one  part  is  filed  as  a  temporary  record  of 
charge,  and  the  third  part  is  sent  to  the  keypunch  operator  to  have 
the  required  Master  Card  or  Charge  and  Return  Card  set  keypunched. 
The  Charge  Card  thus  produced  is  entered  into  the  day's  accumulation 
of  charge  cards  and  processed  normally  by  the  computer.    The  Master 
Bookcard  or  Return  Card  is  filed  manually  to  await  the  return  of  the 
book.    All  returns  containing  the  temporary  return  form  must  have 
the  Bookcard  or  Return  Card  waiting  in  the  file  pulled,  and  then  must 
be  processed  through  the  normal  Hold  procedure. 

The  Library  believes  that  a  book  should  either  be  in  its  proper 
place  on  the  shelf,  or  it  should  be  reflected  by  a  charge  to  a  person 
or  a  location.    Location  charges  are  achieved  by  some  fifteen  special 
plastic  charge  cards  kept  in  the  Circulation  Office  which  are  punched 
with  special  numbers  for  "Reserve,"  "Bindery,"  "Interlibrary  Loan," 
etc.    All  charges  are  maintained  in  one  circulation  file  in  call  number 
sequence  so  that  location  and  patron  charges  are  interfiled  for  easy 
reference  on  the  circulation  listing. 

One  additional  feature  of  the  system  is  worth  special  note.    The 
use  of  an  accession  or  item  number  unique  to  each  volume  permits  a 
thief -proof  door  check  system.    In  a  "two -card"  system  such  as  this, 
all  information  is  machine  recorded:    book  identification,  borrower 
identification,  and  date  due.    No  manual  stamping  or  validating  is 
required.      The  accession  number  is  stamped  both  in  the  book  and  on 
a  pocket  during  initial  processing,  and  this  pocket  is  then  glued  firm- 
ly in  the  book.    The  accession  number  labels  which  were  computer 
printed  during  the  conversion  period  also  provide  the  item  number 
for  back  holdings.    The  Library  Doorman  thus  matches  item  number 
on  the  pink  Return  Card  and  item  number  on  the  pocket  as  proof  of 
valid  charge.    Other  information,  including  date  due,  need  not  be 
checked.    The  unique  item  number  guarantees  the  current  validity  of 
the  charge  and  also  prevents  the  common  dodge  of  charging  a  book 
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and  then  using  the  Return  Card  in  a  second  book  to  get  it  past  the  door 
check. 

Regular  nightly  operation,  which  has  continued  without  interrup- 
tion since  October  1965,  proceeds  as  follows.    The  Circulation  De- 
partment delivers  to  the  Data  Processing  Center  by  10:30  each  night 
except  Sunday  the  accumulated  Charge  and  Return  cards  in  random 
order.    These  updates  are  sorted,  edited,  and  processed  by  the  IBM 
7094,  and  a  new  listing  of  books  in  circulation  (see  Figure  5)  is  print- 
ed on  a  two  part  paper.    Error  messages,  daily  statistics,  and  a  list- 
ing of  extreme  overdue s  immediately  follow  on  the  listing  (see  Fig- 
ure 6).    First  and  second  overdue  notices  and  fine  notices  (see  Fig- 
ure 7)  are  printed  on  continuous  form  postcards  with  addresses  ready 
for  mailing.    The  update  cards  and  printed  outputs  are  returned  to  the 
Library  by  8:15  each  morning.    The  two  copies  of  the  circulation  list- 
ing are  bound  unburst  and  placed  on  the  desk  for  patron  use,  and  the 
postcards  are  separated  and  mailed.    One  copy  of  the  error  messages 
and  statistics  is  retained  by  Circulation  for  further  action,  and  the 
second  copy  is  forwarded  to  the  Data  Processing  Supervisor  for  in- 
spection. 

Currently,  there  are  about  10,000  charge  records  on  tape  and 
probably  an  average  of  700  updates  (charges  and  returns)  for  each 
day.    The  nightly  run  requires  less  than  three  minutes  of  IBM  7094 
time,  plus  ten  to  fifteen  minutes  of  off-line  printing  on  a  600  1pm  IBM 
1403.    The  program  is  written  entirely  in  COBOL  except  for  an  as- 
sembly language  core  sort  library  subroutine.    The  program  was 
written  and  checked  out  in  approximately  one  month's  time  early  in 
the  fall  of  1965.    The  most  recent  modification,  however,  was  made 
in  February  1966;  it  seems  that  odd  situations  which  were  not  pro- 
vided for  in  the  program  will  continue  to  occur. 

The  total  cost  for  rental  of  the  data  processing  equipment  locat- 
ed in  the  Library,  less  various  standard  educational  discounts,  is 
$355  per  month.    A  flat  charge  of  $400  per  month  is  paid  to  Data  Pro- 
cessing for  use  of  machine  time,  resulting  in  a  yearly  cost  of  $9,060. 
The  continuous  forms  for  the  printed  listing  and  the  postcard  stock, 
plus  the  other  special  card  forms  needed,  result  in  an  additional  cost 
of  approximately  $1,000  per  year. 

While  the  automated  circulation  system  can  be  justified  by  vir- 
tue of  performing  at  least  the  work  of  the  approximately  three  and 
one -half  clerks  represented,  the  advantages  of  the  mechanized  oper- 
ation over  any  traditional  manual  system  cannot  be  over-emphasized. 
The  system  now  provides  better  control  than  was  ever  possible  under 
the  old  manual  system.    At  the  same  time,  it  accommodates  pat- 
rons by  providing  rapid,  easy  charging  of  books  and  by  keeping  an  up- 
to-date,  error  free  record  of  the  location  of  books  charged  out. 

Elimination  of  manual  handling  and  filing  of  records  as  well  as 
the  tremendous  saving  of  time  effected  by  automatic  processing  of 
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overdue  charges  and  delinquent  returns  has  freed  the  circulation  staff 
to  assume  additional  duties  while  still  maintaining  almost  immediate 
reshelving  of  books  as  they  are  returned.    Perhaps  most  significant 
of  all  is  the  fact  that  the  present  system  can  accommodate  double  or 
triple  the  present  volume  of  circulation  with  only  a  minimum  increase 
in  personnel.    The  automated  system  also  offers  great  promise  in  aid- 
ing investigation  of  library  usage  at  Texas  A  &  M  by  analysis  of  ac- 
cumulated charge  data  in  conjunction  with  the  Registrar's  machine 
readable  student  records. 


Automated  Serials  Control 


The  Library's  decision  to  attempt  automation  of  the  Serials 
Record  operation  was  made  early  in  1965.    At  that  time  a  report  out- 
lining basic  concepts  of  the  system  was  prepared,  based  on  the  re- 
sults of  a  Master's  thesis  investigation.!    Initial  programming  and 
experimental  operation  of  this  proposed  system  were  delayed,  how- 
ever, until  January  1966  because  of  the  higher  priority  assigned  to 
the  completion  of  the  design  and  implementation  of  the  automated 
circulation  system  just  described.    When  completely  operational,  the 
automated  serials  system  will  perform  three  major  tasks:    (1)  order- 
ing and  renewing  subscriptions  and  recording  payment  for  each, 

(2)  recording  arrivals  and  flagging  delinquent  arrivals,  as  well  as 
keeping  track  of  all  serial  holdings  to  date  in  the  collection,  and 

(3)  providing  the  information  required  for  more  efficient  control  and 
analysis  of  operations.    In  addition,  the  data  in  the  computer's  records 
can  be  used  to  produce  listings  in  various  formats  for  use  in  public 
service  areas.    Such  a  mundane  use  alone  of  the  stored  data  can  al- 
most justify  the  conversion  to  machine  readable  form. 

Basic  Concepts.    The  basic  design  concept  of  the  data  process- 
ing system  which  supports  the  serials  operation  is  in  some  aspects  a 
departure  from  traditional  approaches.    Historically,  data  processing 
techniques  have  been  developed  to  accommodate  what  was  considered 
a  basic  hardware  limitation.    This  limitation  may  be  described  as  the 
necessity  for  conserving  data  storage  and  processing  time  at  nearly 
any  cost.    This  concept  dates  from  the  early  years  of  card  oriented 
computer  and  unit  record  equipment  systems.    In  the  past,  relatively 
slow  interval  operating  speeds  and  relatively  slow  access  to  mass 
data  storage  dictated  heavy  coding  and  extreme  abbreviation  of  data 
for  applications  involving  extensive  computer  processing.    The  result 
has  been  data  processing  systems  which  in  reality  were  designed  to 
accommodate  the  machines  involved  and  not  the  personnel  who  must 
ultimately  handle  and  analyze  the  information  produced. 
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Current  sophistication  of  computer  hardware  and  the  promise 
of  additional  technological  advance  dictate  that  completely  efficient 
hardware  utilization  is  no  longer  the  prime  measure  of  effectiveness. 
Modern  high  speed  computers  with  practically  unlimited  rapid  access 
data  storage  capability  allow  feasible  design  of  data  processing  sys- 
tems suitable  to  the  needs  of  the  application  as  well  as  the  personnel 
involved.    While  relatively  inefficient  internal  processing  procedures 
may  be  called  for,  the  resulting  performance  of  the  system  as  a 
whole  is  the  criterion  that  must  be  paramount.    Input  and  output  of 
information  and  data  handling  procedures  are,  therefore,  organized  to 
accommodate  the  library  personnel  involved.    Data  are  stored  in  the 
computer  uncoded  and  of  variable  length  wherever  possible,  especially 
when  coding  and  decoding  would  become  cumbersome  for  clerical 
personnel.    Stored  in  this  form,  the  data  can  also  logically  be  utilized 
by  or  converted  to  the  format  required  for  nearly  any  conceivable 
processing  system. 

The  Serials  Department  personnel  will  be  involved  as  little  as 
possible  in  data  processing  lore  or  systems  of  special  abbreviations 
and  other  confusing  requirements.    The  automated  system  is,  there- 
fore, divided  logically  into  two  subsystems:   the  clerical  system  and 
the  data  processing  system.    Only  very  limited  familiarity  with  the 
computer  system  is  required  on  the  part  of  library  personnel,  and  the 
two  systems  meet  only  through  the  interface  required  to  convert  data 
to  machine  readable  form.    It  should  also  be  understood  in  consider- 
ing the  approach  outlined  in  this  paper  that  the  system  is  designed  as 
a  compromise  between  the  ideal  environment  of  a  random  store  of 
data  accessible  on  a  real-time  basis  and  the  actual  fact  of  presently 
available  equipment.    While  the  eventual  goal,  based  on  expected  future 
hardware  delivery  is  the  more  sophisticated  operation  just  mentioned, 
the  system  is  planned,  for  the  present,  to  provide  the  most  efficient 
operation  possible  utilizing  the  capabilities  of  a  computer  located  at 
a  physically  remote  facility  with  only  limited  processing  time  avail- 
able for  library  operations. 

File  Organization.    Each  item  of  data  must  be  uniquely  identifi- 
able and  individually  retrievable.    The  master  tape  file  is  composed 
of  a  master  unit  record  for  each  entry.    Desired  items  of  data  will 
be  grouped  in  various  printed  outputs  for  use  as  working  files  by 
Serials  Department  personnel.    The  master  tape  record  for  each  en- 
try is  grouped  logically  into  several  sections:    publishing  and  fre- 
quency data,  financial  and  ordering  data,  bound  holdings  data,  and 
bibliographic  data,  for  example.    In  addition,  some  master  records 
will  contain  only  one  section,  cross  reference  data,  to  the  exclusion 
of  all  others.    This  method  of  processing  also  requires  that  each  copy 
of  a  serial  subscription  be  represented  by  a  separate  master  record 
with  the  appropriate  logical  sections. 
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Each  master  record  is  identified  by  a  unique  six-digit  serial 
identification  number  (SIN).    This  number  is  assigned  to  the  master 
records  so  that  numerical  arrangement  by  SIN  will  produce  an  alpha- 
betical arrangement  of  entries  in  the  Serials  Record.    In  addition  to 
its  use  as  an  alphabetical  sequence  number,  the  SIN  serves  as  the 
unique  sequential  processing  number  required  for  a  magnetic  tape 
file  application.    The  SIN  was  initially  assigned  at  intervals  of  100  to 
allow  future  insertion  of  new  entries  in  proper  sequence.    Whenever 
an  individual  interval  becomes  full,  the  computer  renumbers  the  en- 
tire master  file,  re-establishing  the  required  interval. 

Individual  data  elements  are  stored  and  processed  by  the  com- 
puter as  both  fixed  and  variable  length  alphanumeric  character 
strings.    A  system  reserved  symbol,  the  dollar  sign,  is  used  to  indi- 
cate the  length  of  the  variable  length  strings  and  is  the  only  character 
not  available  for  use  within  a  string.    The  character  strings  are 
uniquely  identified  by  the  SIN  and  a  keyword  which  has  a  high  mne- 
monic value  for  use  by  Serials  personnel. 

The  keyword  is  a  ten-digit  maximum  English  language  word  used 
to  identify  individual  character  strings.    There  is  also  a  special  class 
of  keywords  that  do  not  have  character  strings  attached  but  are  used 
to  set  flags  for  control  purposes.    Allowable  keywords  (see  Figure  8) 
comprise  a  Serials  Record  "dictionary"  which  is  open-ended.    Ad- 
dition of  new  data  elements,  for  example,  may  be  provided  by  simple 
modification  of  the  program's  keyword  table.    Keywords  for  each 
logical  record  section  of  the  Master  Record  are  additionally  divided 
into  two  groups:    required  keywords  and  optional  keywords.    Exten- 
sive editing  is  built  into  the  program  and  all  required  keywords  must 
be  present,  for  example,  for  the  particular  record  section  to  be  ac- 
cepted.   Presence  or  absence  of  optional  keywords  is  governed  by  the 
nature  of  the  individual  entry. 

An  additional  open-ended  feature  is  the  ability  to  introduce  an 
English  language  "note"  into  the  master  file  whenever  necessary. 
The  keyword  XNOTE  is  used,  where  "X"  represents  the  first  charac- 
ter of  the  name  of  the  logical  record  section  in  which  the  note  is  to  be 
included.    This  feature  is  the  fail-safe  measure;  if  all  else  fails,  hope- 
fully the  file  can  contain  a  short  note  to  the  effect  that  manual  pro- 
cessing is  the  preferred  course  of  action.    The  master  records  are 
maintained  in  sorted  keyword  within  SIN  order.    Bound  and  unbound 
holdings  record  sections  are  additionally  sorted,  however,  by  the 
major  sequence  number  for  the  particular  title  as  included  in  the 
bound  character  string.    Holdings  data  thus  also  require  interpretation 
of  the  character  string  for  unique  identification. 

Input  Requirements.    Input  to  the  computer  system  is  presently 
in  the  form  of  punched  cards.    The  serial  identification  number  (SIN) 
occupies  cc  1-6.    The  keyword  (KYWD)  occupies  cc  15-28.    Card 
columns  13  and  14  are  used  for  sequence  numbering  (SEQ)  of  input 


177 

cards  when  variable  length  character  strings  extend  beyond  one  card. 
Card  columns  25-80  contain  the  character  string  (DATA).    Card 
columns  7-12  contain  a  six  character  maximum  operation  codeword 
(OP).    This  codeword  instructs  the  computer  program  to  perform  one 
of  several  operations  such  as  ADD,  DELETE,  or  UPDATE. 

Clerical  procedures  require  data  collection  forms  similar  to 
that  shown  in  Figure  9.    Data  are  recorded  in  a  manner  familiar  to 
clerical  personnel  whenever  feasible.    Months,  for  instance,  are 
entered  as  common  three  letter  abbreviations  and  converted  to  the 
numeric  representation  required  for  calculation.    Figure  10  is  a 
sample  listing  of  input  cards. 

As  soon  as  practicable,  a  paper  tape  typewriter  will  be  used  in 
the  Serials  Department  instead  of  the  hand  written  data  collection 
form.    The  typed  copy  will  serve  to  verify  data  to  be  entered  in  the 
computer  file  and  will  serve  as  the  necessary  audit  trail  until  the 
master  file  is  updated.    The  punched  paper  tape  produced  simultane- 
ously will  then  be  used  as  direct  input  instead  of  punched  cards. 
Availability  of  a  keypunch,  however,  will  always  be  essential  for  pro- 
cessing punched  card  arrival  files  for  current  receipts. 

Output.    The  output  can  be  divided  into  two  categories:   punched 
and  printed.    Printed  outputs  include  both  the  working  files  for  the 
Serials  Department  as  well  as  the  various  listings  for  public  service 
areas.    The  working  files  will  be  produced  monthly  and  will  consist 
of  four  physical  files:    (1)  subscription  information,  (2)  bound  hold- 
ings and  bibliographic  data,  (3)  current  and  unbound  items,  and  (4)  a 
want  list.    The  bound  holdings  file,  for  example,  will  describe  each 
bound  volume  individually,  indicating  accession  number,  inclusive 
dates,  and  published  volumes  included.    The  computer  will  produce  an 
item  by  item  shelflist  record  for  inventory  control  by  the  Serials  De- 
partment (see  Figure  11)  while  any  holdings  information  produced  for 
public  service  areas  will  probably  be  printed  as  inclusive  holdings  in 
the  form  of  a  union  list.    In  addition  to  regularly  scheduled  output,  the 
number  of  special  reports  and  statistical  listings  which  can  be  easily 
produced  from  the  machine  records  is  limited  only  by  the  ingenuity  of 
the  programmer  and  the  amount  of  storage  space  in  the  Serials  De- 
partment. 

Direct  subscription  renewals  will  be  initiated  by  a  monthly  list- 
ing indicating  those  titles  requiring  attention.    The  annual  list  of 
domestic  serials  required  for  bidding  purposes  in  selection  of  a  state 
contract  jobber  will  also  be  printed  on  continuous  multilith  masters 
and  reproduced  in  the  required  number  of  copies.    The  possibility  of 
completely  automatic  subscription  renewal  with  computer  printed 
purchase  orders  seems  remote,  however,  considering  the  complexity 
of  existing  state  regulations. 

The  punched  card  output  is  used  for  inventory  control  and  check- 
in  of  expected  arrivals.    At  the  beginning  of  each  month,  one  card  is 
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produced  for  each  item  expected  during  the  month.    Prepunched  into 
each  card  will  be  the  serial  identification  number,  the  operation  code- 
word UPDATE,  the  keyword  UNIT,  and  a  description  of  the  serial  is- 
sue.   Information  printed  on  the  arrival  card  for  use  by  clerical  per- 
sonnel will  include  this  date,  plus  all  other  information  required  to 
check-in  and  distribute  the  issue.    This  information  may  also  include 
an  optional  forty  character  note,  such  as  "ROUTE"  or  "CHECK 
LOOSE  INSERTS."    As  the  card  is  produced,  the  SIN  punched  into  the 
card  is  checked  to  insure  that  the  correct  information  is  being  printed 
on  the  card. 

Arrival  cards  are  maintained  in  two  separate  card  files:    cards 
for  expected  arrivals  and  cards  for  received  arrivals.    Clerical  pro- 
cedure involves  matching  serial  issues  and  marking  them  for  distri- 
bution, as  well  as  the  transfer  of  the  corresponding  arrival  card  to 
the  received  file.    Library  files  thus  show  serial  holdings  as  issues 
expected  or  issues  received  in  the  card  files,  as  issues  recorded  in 
the  unbound  holdings  record,  as  bound  volumes,  or  as  missing  items 
appearing  in  the  want  list. 

One  arrival  card  will  be  produced  for  every  anticipated  indi- 
vidual item.    An  index,  for  example,  will  have  a  separate  arrival  card 
if  it  arrives  separately,  or  it  will  be  indicated  on  the  arrival  card  for 
the  issue  in  which  it  is  bound.    Predictable  arrivals  of  supplements 
will  be  indicated  by  separate  arrival  cards.    Each  copy  of  a  serial 
subscription  would  also  require  separate  arrival  cards.    A  serial 
subscription  for  two  copies  with  index  separate,  for  example,  would 
require  four  arrival  cards  for  the  last  issue  (see  Figure  12). 

All  active  subscriptions  will  be  coded  as  regular  or  irregular 
arrivals.    A  regular  subscription  is  by  definition  any  predictable  pat- 
tern of  publication  for  which  arrival  cards  can  be  produced.    All  other 
irregular  serials  with  completely  unpredictable  frequency  will  be 
indicated  by  a  printed  list.    When  received,  these  unexpected  items 
will  require  manual  keypunching  of  an  arrival  card  to  update  the 
master  file.    Claiming  of  delinquent  arrivals  will  be  aided  by  in- 
spection of  the  arrival  cards  remaining  in  the  file  at  the  end  of  each 
month.    The  decision  to  claim  missing  material,  however,  must  be 
based  on  the  knowledge  and  judgment  of  an  experienced  staff  member. 
"Claimed"  cards  are  flagged  and  returned  to  the  expected  arrivals 
file. 

A  second  punched  output  planned  is  a  bindery  "tickler"  card. 
This  card  will  be  produced  in  a  preassigned  month  so  that  the  binding 
load  is  staggered  evenly  over  a  twelve -month  period.    The  bindery 
clerk  will  receive  a  card  at  the  beginning  of  each  month  for  each  title 
to  be  bound.    The  card  will  list  receipts  as  recorded  by  the  computer. 
These  cards  constitute  a  "tickler"  file  only;  if  visual  inspection  indi- 
cates missing  issues  or  insufficient  bulk  for  binding,  the  card  will  be 
refiled  until  the  missing  material  is  received  or  until  the  next  bindery 
card  for  the  title  is  produced. 
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Serial  issues  are  normally  bound  into  physical  volumes  of  a 
size  to  "fit  the  hand."    The  decision  to  bind  a  given  number  of  issues 
into  a  physical  volume  must  be  based  on  the  judgment  of  the  bindery 
clerk.    In  addition,  physical  size  of  individual  issues  may  change 
drastically  and  unexpectedly,  requiring  an  exception  to  the  normal 
routine  frequency  of  binding.    The  backlog  of  cards  in  the  bindery  file 
will  also  aid  the  clerk  in  estimating  the  backlog  of  unbound  material 
in  the  collection. 

When  fully  implemented,  the  computer  system  will  automatically 
compile  monthly  operating  statistics  in  addition  to  routine  maintenance 
of  the  master  file  just  described.    The  system  is  being  programmed 
for  an  IBM  7094-1401  combination,  which  includes  a  model  1404 
printer  off-line.    All  non-1401  programs  are  being  written  in  COBOL 
whenever  possible  to  insure  compatability  both  for  the  expected  ex- 
pansion to  IBM  system  360  equipment,  and  for  the  different  computer 
configurations  at  Rice  University  and  the  University  of  Houston. 

The  computer  facility  is  made  available  to  other  University  de- 
partments by  the  Data  Processing  Center  at  a  reduced  rate,  and  the 
Center  operates  as  a  service  facility  for  University  operations  as 
well  as  serving  as  an  educational  facility.    For  present  purposes  the 
Library's  need  is  paramount,  and  no  attempt  is  being  made  to  present 
a  solid  economic  justification  for  the  system  which  is  being  developed. 

Present  Accomplishments.    In  preparation  for  the  approaching 
conversion  of  the  Serials  Record,  a  Serials  Data  Processing  Clerk 
was  employed  as  a  full-time  staff  member  in  the  fall  of  1965.    This 
young  lady,  working  under  the  direction  of  the  Serials  Librarian,  acts 
as  the  interface  between  the  sometimes  unintelligible  serials  record 
files  and  the  keypunch  operators  by  re-ordering  and  organizing  the 
information  on  a  form  suitable  for  keypunching. 

Any  data  input  to  a  machine  records  system  must  be  complete 
and  accurate,  particularly  bound  and  unbound  holdings  information 
used  for  inventory  control.    To  achieve  this  lofty  goal  the  Library 
also  employed  last  fall  a  crew  of  full-time  clerks  to  begin  a  physical 
inventory  of  holdings  to  verify  the  existing  but  unreliable  records. 
Holdings  information  will  be  converted  to  machine  readable  form  only 
after  being  verified  by  the  inventory  crew.    It  is  expected  that  this 
project  will  last  at  least  another  year. 

The  conversion  of  other  Serials  Record  data  to  machine  read- 
able form  was  begun  in  February  1966.    Initially,  a  serial  identifi- 
cation number,  location,  call  number,  title,  and  the  control  keywords, 
SERIAL  or  PERIODICAL  and  ACTIVE  or  INACTIVE  were  coded  for 
each  of  the  approximately  7000  entries  in  the  serials  holding  file  and 
transferred  to  a  magnetic  tape  file.    All  cross  references  were  also 
keypunched  and  added  to  the  file.    In  addition,  the  necessary  pro- 
cedure was  established  in  the  Serials  Department  to  insure  that  title 
changes  and  new  subscriptions  would  henceforth  be  keypunched  so 
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that  an  updated  file  could  be  maintained.    All  keypunching  is  being 
done  as  much  as  possible  by  the  regular  keypunch  operator  during  the 
day,  who  is  also  responsible  for  keypunching  for  the  Circulation  De- 
partment as  well  as  all  other  library  work.    A  second  keypunch  oper- 
ator is  employed  part-time  on  a  second-shift  basis  to  cope  with  the 
volume  of  data  waiting  to  be  keypunched. 

Two  listings  were  immediately  produced  from  the  data.    The 
first  was  what  might  be  called  a  Union  List  of  Serial  Titles  for  Texas 
A  &  M  University  Library.    No  holdings  information  whatsoever  is 
reflected,  but  all  cross  references  are  included.    This  listing,  which 
up  to  now  has  been  printed  only  once  on  Multilith  masters  and  repro- 
duced for  campus  distribution,  will  be  reprinted  only  as  the  need 
dictates. 

The  second  listing  includes  only  current  periodical  subscrip- 
tions by  current  title;  no  cross  references  are  provided.    This  listing 
contains  the  approximately  3500  titles  which  are  available  in  the  cur- 
rent periodical  reading  rooms  in  both  main  and  branch  libraries,  and 
in  the  main  library  Reference  Department.    It  is  available  in  several 
copies  in  public  service  areas  in  both  main  and  branch  libraries  and 
is  reprinted  presently  on  a  twice  monthly  basis  (see  Figure  13). 

A  little  imagination  will  suggest  numerous  other  uses  for  the 
presently  limited  amount  of  data  now  available  in  machine  readable 
form.    One  can  get  simple  statistical  counts  by  selected  character- 
istics or  listings  arranged  by  call  number  or  location,  or  combinations 
of  the  above.    The  question  is,  Do  we  really  want  this  particular 
breakdown  at  this  point?    There  is  a  temptation  to  bury  the  operation 
in  mounds  of  needless  printouts  of  unwanted  information. 

The  next  step  is  the  complete  conversion  of  financial  and  sub- 
scription information,  as  well  as  beginning  in  earnest  the  conversion 
of  the  verified  holdings  records  for  the  inventoried  portion  of  the 
collection.    The  Data  Processing  Clerk  records  the  ordering  and 
subscription  data  on  mimeographed  data  collection  forms,  which  are 
batched  and  sent  to  the  keypunch  operator.    Meanwhile,  holdings  in- 
formation is  keypunched  directly  from  shelflist  cards  in  their  catalog 
trays.    As  the  ordering  and  subscription  data  are  converted,  the  com- 
puter will  immediately  begin  to  flag  subscription  renewals,  and  the 
periodicals  list  required  by  the  state  for  bidding  purposes  will  be 
produced  in  August  1966  from  the  information  in  the  magnetic  tape 
file.    Our  approach  to  conversion  and  to  the  associated  experimental 
programming  and  debugging  is  that  each  mass  conversion  of  a  seg- 
ment of  the  Serials  Record  will  justify  its  own  existence  by  immedi- 
ate utility.    Long  before  complete  holdings  information  is  available  in 
machine  readable  form,  the  automated  serials  system  will  be  per- 
forming nearly  all  of  the  other  functions  outlined  previously. 
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Conclusion 


The  progress  made  in  applying  data  processing  techniques  to 
library  operations  in  less  than  two  years  is  frankly  astonishing.    One 
system  operates  beyond  expectations,  and  a  second  major  operation 
should  be  successfully  automated  within  the  next  year  or  two.    Several 
lessons  have  been  learned  in  the  course  of  the  last  two  years.    There 
is  no  such  thing  as  a  completely  accurate  record  where  serials  are 
concerned.    Attempting  to  convert  data  for  a  computer  operation  tends 
to  spotlight  any  weakness  in  accurate  recording  of  information  and, 
for  that  matter,  in  the  procedures  which  supposedly  insure  that  this 
accuracy  is  maintained. 

There  is  an  old  axiom  that  a  computer  can  best  be  considered 
as  exceedingly  stupid,  but  it  can  be  letter-perfect  where  repetitive 
filing  and  record  maintenance  are  concerned.    We  are  convinced  that 
perfection  is  a  gift  with  which  mere  humans  are  not  yet  ready  to 
cope;  the  perfect  err  or -free  input  called  for  cannot  be  presented,  and 
consequently  we  spend  quite  a  bit  of  time  correcting  mistakes  that  we 
created  ourselves  only  the  run  before.    We  have  also  discovered  the 
great  need  for  rethinking  what  must  be  done.    It  is  easy  to  mechanize 
existing  processes,  without  reviewing  the  whys  of  the  operation  or, 
in  many  cases,  its  usefulness  in  the  first  place. 

Finally,  we  believe  that  our  progress  can  be  substantially  in- 
creased if  the  library  staff  is  given  instruction  in  basic  punched  card 
and  computer  concepts  and  then  kept  abreast  of  the  immediate  as  well 
as  overall  objectives.    To  accomplish  this,  a  continuing  program  of 
lecture  and  demonstration  meetings  has  been  established  in  which  the 
professional  librarians  and  most  of  the  clerical  employees  partici- 
pate.   They  are  encouraged  to  read  new  publications  of  potential  inter- 
est.   In  the  past,  representatives  of  other  libraries  have  also  been 
invited  to  attend  the  two  major  staff  seminar  meetings. 
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Figure  2 
Examples  of  the  Three  Punched  Cards  Used  in  the  Automated  Circulation  System 
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^) 

OUR  RECORDS  INDICATE  THAT  THE 

0 

CJ- 

FOLLOWING  BOOK  IS  OVERDUE  AT  TEXAS 

?rlfc 

A+M  UNIVERSITY  MAIN  LIBRARY. 

U 

PLEASE  PRESENT  THIS  NOTICE  WHEN 

RETURNING  BOOK. 

0 

CALL  NUMBER 

0 

o 

832  G599FH  

0 

o 

ITEM  NO             DATE  DUE                      6462846350 

0 

247710             12/01/65                     MISS  JEAN  M  MARTIN 

210  FAIRWAY 

o 

BRYAN   TEXAS 

o 

INOT  RETURNED  AS  OF  12/08/651 

o 

o 

_/•»- 

o 

OUR  RECORDS  INDICATE  THAT  THE 
FOLLOWING  BOOK  WAS  RETURNED    9  DAYS 
OVERDUE  AT  THE  TEXAS  A  +  M  UNIVERSITY  MAIN 

o— 
0 

LIBRARY.  YOUR  DELINQUENT  FINE  IS  $2.25. 

o 

CALL  NUMBER 

o 

o 

ITEM  NO      DATE  DUE      DATE  RETD 

o 

o 

21177.        11/29/65       12/08/65 

o 

0 

PLEASE  CONTACT  THE  MAIN  LIBRARY                   638202 
CIRCULATION  DESK  IN  ORDER  THAT  WE 

o 

••• 

MAY  RESOLVE  THIS  MATTER  TO  THE                       MR  RICHARD  A  JOHNSON 

0 

SATISFACTION  OF  ALL  CONCERNED.                     2801  N  HWY  6 
UNPAID  FINES  ARE  BILLED  THROUGH                 BRYAN   TEXAS 

o 

THE  FISCAL  OFFICE  FOR  COLLECTION. 

0 

.0. 

0 

OUR  RECORDS  INDICATE  THAT  THE 
FOLLOWING  BOOK  WAS  RETURNED    9  DAYS                          •»•  FILE  COPY  ••• 

o 

OVERDUE  AT  THE  TEXAS  A+M  UNIVERSITY  MAIN 

LIBRARY.  YOUR  DELINQUENT  FINE  IS  $2.25. 

o 

o 

CALL  NUMBER 

0 

949.5....  ...F688...  

o 

0 

ITEM  NO      DATE  DUE      DATE  RETD 

0 

21177.        11/29/65       12/08/65 

0 

PLEASE  CONTACT  THE  MAIN  LIBRARY                   63820? 

0 

CIRCULATION  DESK  IN  ORDER  THAT  WE 

0 

MAY  RESOLVF  THIS  MATTER  TO  THE                      MR  RICHARD  A  JOHNSON 
SATISFACTION  OF  ALL  CONCERNED.                     2801  N  HWY  6 

0 

UNPAID  FINES  ARE  BILLED  THROUGH                  BRYAN   TEXAS 

0 

THE  FISCAL  OFFICE  FOR  COLLECTION. 

o 

Figure  7 
Sample  Page  of  Overdue  Notices  Prepared  by  the  Computer 
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0*0      OONTKOL 

OESSING    CENTER 
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•                              '.      '                                    NEWSIN 
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1                1 
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1                          4                           •                           •                          T                           • 
•  •fO*OIIS4IOT090ltt4MrO»OII»4MTOtOIIS4MT900lt»4*«T090 

KPVWOT*H«I  1 

(2)  -ORDER                                                   QJ     ujjjy 
-CLASS                                                       UNIT 
-COPIES 
-LANGUAGE                                                    UNOTE 
-PUBLISHED 
-SOURCE                                                      (M  BOUND 
-TYPE                                                      Bnriim 

Required          .ADDED 
.ENTRT 
.KIND 
.LOCATE 

Optional            .ARRNOTE 
.BEGIN 
.BIND 
.CEASED 

-ADBIILS 
-BIDNOTE                                                      BNOTE 
-DOCUMENT 

"tUJSLr                                                     &     HISTORY 

ST 

-ORDERED                                                    /t)     ,,,-,. 

~PAYOK                                                         LACK 
-PREVIOUS 

-PRICE                                                         jjj 
-RENEW 

^JOTE                                                         (7)     XRBEER 
XREFER 

.CURRENT 
.CUTTER 
.DEW  El 
.FREQUENCT 
.NUM/VOL 

.SUPPL 
.TITLE 
.TPI 

•NOTE 

REVISION  DATE 

Control             .ACTIVE 
.OPT 
.  CONTINUED 
.NOUNITS 

-NEWBID 

Figure  8 
Format  for  keypunching  the  Serials  Record  Input 
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Keypwtched    -  D.te 
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Figure  9 
Form  for  Preparing  Serials  Record  Input 
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107840NEWSIN 
107840ADO      XREFER 

I08350NEWS.IN_ 
108350AOD     1XREFER 
108350ADD     2XREFER 

238350NEWSIN 
238350ADD      XREFER 

784350NEWSIN~~ 
784350ADD     1XREFER 
784350ADD     2XREFER 

865450NEWSIN 
JB6.5450ADD.  .   .  XRJEF-ER. 


AMERICAN  MANAGEMENT  REVIEW.  SEE  MANAGEMENT  REVIEW. $ 


ASSOCIATION  OF  CORPORATICN  TRAINING.  BULLETIN.  SEE 
MANAGEMENT  REVIEW. S 


CORPORATION  TRAINING.  SEE  MANAGEMENT  REVIEW.* 


NATIONAL  ASSOCIATION  OF  CORPORATION  SCHOOLS.  BULLETIN. 
SEE  MANAGEMENT  REVIEW.* 


.  .PERSONNEL... ADMINISTRATION.  SEE  MANAGEMENT  REVIEW.* 


778300NEWSIN 
778300ADD      .ACTIVE    t 
778300ADD      .ADDED     JAN65* 
778300AOD      .BEGIN     JAN* 
77~8300AOD   "  .fl'lND   '~  O'OIOOOOOOOOO* 
778300ADO      .CURRENT   V54N1Y1965* 
778300ADD      .CUTTER    M266* 
778300AOD      .DEWEY     650.5* 
778300AOD      .ENTRY     MANAGEMENT  REVIEW.* 
778300ADD  •     .FREQUENCYREG1  11  1  11111111* 

, 
oooooooooooo 

oooooooooooo 
oooooooooooo 

oooooooooooo 

.KINO 
.LOCATE 
.NUM/VOL 
.NUM/YR 
.TPI 
-CLASS 
-COPItS 
-FUND 
-LANGUAGE 
-MEMBER 
-ORDERED 
-PAYOK 

ORIGINAL* 
H-SS*      •  

12* 
SEHDEC* 
PERIODICAL* 

2> 
14120-5003* 
ENGLISH*         "7." 
AMERICAN  MANAGEMENT  ASSOCIATION* 
09/03/64  J-01965,V54t 
01/05/65*  . 

778300ADD 
778300ADD 
778300ADD 
778300ADO 
778300ADD 
778300ADD 

-PREVIOUS  25.00* 
-PRICE     25.00* 
-PUBLISHEDUNITED  STATES* 
-RENEW     AUG* 
-SOURCE    BID  LIST* 
-TYPE      RENEWAL* 

778300ADO 
778300ADD 
778300ADD 
778300ADD 
778300ADD 
778300ADD 

BOUND 
BOUND 
BOUND 
BOUND 
BOUND 
BOUND 

25831V1.1914* 
25832V2.1915* 

25833V3il916t 

154352V5t  1918* 
154353V6.1919* 

778300AOD 
778300ADD 
r778300ADD" 
778300ADD 
778300ADD 
778300ADD 

BOUND 
BOUND 
BOUND 
BOUND 
BOUND 
BOUND 

154230V7.1920* 
154231V8.1921* 

375602  V9-  12  i  1922-23* 
243104V13rl924S 
243105V15,1926t 
243106V18.1929* 

778300AOD 
778300AOD 
778300AOO" 
778300AOO 

BOUND 
BOUND 
"  BOUND 
BOUND 

243107V20tl931* 
154354V21tl932* 

154355V22t  1933* 
tS4356V23,1934* 

Figure  10 
Sample  Page  of  Entries  in  the  Serial  Record 
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BOUND  HOLDINGS 


778300    MANAG 


650.5 
M266 


HISTORY 


BOUND 


MANAGEHENT  REVIEW. 


V1-7N8.1914-AUG1920  AS  NATIONAL  ASSOCIATION  OF 

CORPORATION  SCHOOLS. BULLET  IN.  V 7N9-V8 t SEP1920-JAN1922  AS 
NATIONAL  ASSOCIATION  OF  CORPORATION  TRAINING. BULLETIN. 
V9.FEB-APR1922  AS  CORPORATION  TRAINING.  V10-U .MAY1922- 
MAR1923  AS  PERSONNEL  ADMINISTRATION.  V  1  ?- 1  <• ,  APR  1  923-Of.C 
1925  AS  AMLKICAN  MANAGEMENT  RLVIEH. 


25831 
25832 
25833 
25834 
154352 
154353 
154230 
154231 
375602 
243104 
243105 
243106 
243107 
154354 
154355 
154356 
154357 
176761 
295888 
154358 
154359 


Vl,1914 

V2.1915 

V3.1916 

V4.1917 

V5.1918 

V6.1919 

V7.1920 

V8tl921 

V9-12, 1922-23 

V13tl924 

V15.1926 

V18. 1929 

V20.1931 

V21.1932 

V22.1933 

V23.1934 

V24.1935 

V25.1936 

V26.1937 

V27.1938 

V28t1939 


154360 
295913 
295912 
176760 
243143 
295911 
225311 
216622 
216623 
243108 
224965 
224966 
243109 
344483 
341556 
341557 
304755 
341558 
341559 
341560 
502564 
586431 


V29, 1940 
V30.1941 
V32.1943 
V33.1944 
V34.1945 
V35.1946 
V36.1947 
V37, 1948 
V38.1949 
V39,1950 
V40.1951 
V41.1952 
V42.1953 
V43.1954 
V44.195S 
V45.1956 
V46.1957 
V47.1958 
V48.1959 
V49.1960 
V50.1961 
V51(1962 


Figure  11 
Sample  Page  of  the  Bound  Holdings  File 


Figure  12 

Examples  of  Arrival  Cards  for  Two  Copies  of  a  Serial,  with  Separate 

Volume  Indexes 
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TEXAS  A  +  M  UNIVERSITY  LIBRARY 

FEED 

PERIODICAL  LIST 
02/14/66 

636.  505/H361 

FEED  AND  FARM  SUPPLIER 

630 

636.08505/F295 
636.08505/F2957 

FEED  BAG 
FEEDS  ILLUSTRATED 

630 
630 

658.93305/F295 

FEEOSTUFFS 
FELSMECHANIK  UND  INGENI  EURGEOLOGIE 

650 
EL 

631.605/F411 
616.6905/F411 

FERTILISER  NEWS 
FERTILITY  AND  STERILITY 

630 
610 

665.05/F421 

FERTILITY  AND  STERILITY 
FETTE-SEIFEN-ANSTR1CHMITTEL 

VET 
660 

512.8105/F443 
677.05/F443 

FIBONACCI  QUARTERLY 
FIBRE  AND  FABRIC 

510 
670 

799.05/F453 

FICHERO  BIBLIOGRAFICO  HISPANOAMERICANO 
FIELD  AND  STREAM 

REF 
700 

633.205/C734F 
791.4305/F487 

FIELD  CROP  ABSTRACTS 
FILMFACTS 

REF 
REF 

332.6305/A532 

FINANCIAL  ANALYSTS  JOURNAL 
FINANCIAL  EXECUTIVE 

330 
BA 

332.05/F491 

FINANCIAL  WORLD 
FINANCIAL  WORLD 

330 
BA 

FINANCING  HIGHER  EDUCATION 
FINISHING  ABSTRACTS 

370 
EL 

551.4606/F511M 

FINLAND.   HAVSFORSKNINGSINSTITUT. 
MERENTUTKIMUSLAITOKSEN 

FIRE  ENGINEERING 
FIRE  RESEARCH  ABSTRACTS  AND  REVIEWS 

EL 
EL 

812.05/F527 
639.205/F537 

FIRST  STAGE  -  A  QUARTERLY  OF  NEW  DRAMA 
FISHERIES  NEWSLETTER 

800 
630 

664.905/F596 

FLEET  OWNER 
DIE  FLEISCHWIRTSCHAFT 

EL 
660 

FLIGHT  INTERNATIONAL 
FLIGHT  MAGAZINE 

EL 
EL 

580.5/F632 
378.759/F63S 

FLORA 
FLORIDA.   STATE  UNIVERSITY.  TALLAHASSEE.   RESEARCH 

580 

506/F636J 

COUNCIL.   FLORIDA  STATE  UNIVERSITY  STUDIES 
FLORIDA  ACADEMY  OF  SCIENCES.   QUARTERLY  JOURNAL 

500 

595.705/F636 

FLORIDA  ENGINEERING  SOCIETY.   JOURNAL 
FLORIDA  ENTOMOLOGIST 

EL 
590 

630.5/F63B 
570.8/F6368 

FLORIDA  GROWER  AND  RANCHER 
FLORIDA.   UNIVERSITY,  GAINESVILLE.   STATE  MUSEUM. 

630 
570 

639.105/F636 

BULLETIN 
FLORIDA  WILDLIFE 

630 

635.905/F644 

FLOW  LINE 
FLOWER  GROWER 

EL 
630 

FLUID  HANDLING 
FLYING 

EL 
EL 

050/F6S2 

FOCUS  (AMERICAN  GEOGRAPHICAL  SOCIETY) 
FOCUS  (CONFERENCE  BOARD) 

050 
BA 

574.05/F665 
612.105/F665 

FOLIA  BIOLOGICA 
FOLIA  HAEMATOLOGICA 

570 
610 

398.05/F666 
398.05/F666F 

FOLK-LORE  (FOLKLORE  SOCIETY,  LONDON) 
FOLK-LORE  (INDIAN  PUBLICATIONS) 

390 
390 

784.405/F666 
398.05/F666J 

FOLKLORE  AND  FOLK  MUSIC  ARCHIVIST 
FOLKLORE  INSTITUTE.   JOURNAL 

700 
390 

338.105/F636 
338.10611/U5755D 

FOOD  AND  AGRICULTURAL  LEGISLATION 
FOOD  AND   AGRICULTURE  ORGANIZATION  OF  THE   UNITED 

330 

338.10611/U5755CS 

NATIONS.   FAO  AGRICULTURE  DEVELOPMENT  PAPER 
.   FAO  COMMODITY  BULLETIN  SERIES 

Figure  13 
Sample  Page  of  the  Current  Periodicals  Subscription  List 


COMPUTER  APPLICATIONS  TO 
BOOK  CATALOGS  AND  LIBRARY  SYSTEMS 


Donald  H.  Stromberg 


Librarians  must  look  to  the  future  information  needs  of  a  coun- 
try expanding  in  population,  technology  and  educational  requirements. 
The  "information  explosion"  is  placing  an  additional  strain  on  exist- 
ing methods  of  providing  information  rapidly  and  economically. 

A  library  seeking  to  develop  a  modern  information  retrieval 
program  has  many  existing  services  from  which  to  choose.    The  prob- 
lem is  to  define  the  program  that  will  best  serve  the  present  library 
users  and  leave  room  for  flexible  action  in  the  future,  and  then  to 
pick  a  combination  of  services  that  best  match  these  objectives. 

Documentation  Incorporated  (Doc  Inc)  of  Bethesda,  Maryland, 
was  founded  in  1952  by  the  late  Dr.  Mortimer  Taube,  and  has  been 
engaged  in  developing  modern  information  retrieval  systems  for 
government  and  industry.    A  key  concept  that  is  now  emerging  is  the 
development  of  mechanized  or  computerized  data  banks.    This  data 
bank  concept  is  a  plan  for  organizing  a  single  set  of  data  for  produc- 
ing many  products.    The  traditional  library  card  catalog  or  data  bank, 
long  the  key  to  finding  materials  in  the  nation's  libraries,  today  is 
getting  competition  from  book  catalogs.    Using  data  bank  techniques 
to  keep  a  library  catalog  updated,  Doc  Inc  computer  systems  generate 
printouts  of  the  catalog  which  are  used  to  produce  bound  books  for 
distribution  to  library  users.    In  effect,  the  book  catalogs  are  carry- 
ing the  traditional  card  catalog,  literally,  into  the  homes  and  offices 
of  users  instead  of  requiring  them  to  trek  to  the  library  to  find  out 
if  the  information  they  want  is  available.    The  computer  is  used  to 
produce  several  indexes  (such  as  subject,  author,  and  title)  in  vari- 
ous formats  from  a  single  file  of  data  and  is  particularly  effective  if 
the  catalog  data  bank  is  standardized. 

Despite  the  wide  variety  of  ways  libraries  catalog  books,  most 
differences  are  variations  of  the  Library  of  Congress  cataloging  sys- 
tem.   By  using  the  LC  catalog  system  as  a  basis  or  standard  for 


Donald  H.  Stromberg  is  Director  of  the  Publication  System  Division 
of  Documentation  Incorporated,  Bethesda,  Maryland. 


195 


196 

building  the  data  bank,  most  libraries  can  be  provided  with  just  the 
index  information  they  need  by  selecting  only  those  items  that  apply 
from  the  LC  data  bank.    An  LC  data  bank  system  can  produce  book 
catalogs  for  many  libraries  from  the  same  data,  with  provisions  for 
handling  special  information. 

Having  introduced  the  general  concept  of  Documentation  Incorpo- 
rated, the  following  questions  should  be  answered:    (1)  Why  book  cata- 
logs?   (2)  Why  use  a  computer  to  produce  book  catalogs?    (3)  Why 
develop  a  catalog  data  bank?    Each  of  these  questions  is  inter- 
related in  terms  of  cost,  standards,  service,  and  advancements  in 
technology. 


(1)  Why  Book  Catalogs? 


There  are  advantages  and  disadvantages  to  both  book  catalogs 
and  card  catalogs.    However,  it  is  interesting  to  note  that  the  card 
catalog  for  all  practical  purposes  is  a  little  more  than  seventy  years 
old  and  was  preceded  by  book  catalogs.    In  a  paper,  "Book  Catalogs 
Versus  Card  Catalogs,"  by  Irwin  H.  Pizer,  the  present  trend  back  to 
the  book  catalog  made  possible  by  recent  advances  in  computer  tech- 
nology is  analyzed  and  advantages  and  disadvantages  are  compared.  1 
To  quote  a  final  statement,  "We  have  gained  most  of  the  advantages, 
have  minimized  a  few  of  the  disadvantages,  and,  indeed,  can  look  for- 
ward like  Dr.  Pangloss  to  a  computer  produced  catalog  which  will  be 
the  best  possible  one,  in  this  best  of  all  possible  worlds." 

Book  catalogs  can  cut  costs  by  eliminating  the  purchase  or  pro- 
duction of  cards  for  multiple  locations,  the  expense  of  card  file  main- 
tenance and  card  cases,  and  the  freeing  of  prime  space  now  occupied 
by  bulky  card  catalog  cases.    They  offer  more  service  by  providing 
complete  holdings  catalogs  in  bookmobiles;  cataloging  and  order  sec- 
tions; local,  school,  branch,  and  cooperating  libraries;  reference 
desks;  university  schools,  departments,  and  residences;  and  offices 
of  researchers.    They  can  save  staff  time  by  eliminating  card  catalog 
maintenance  activities  such  as  typing  cards  and  headings,  sorting, 
filing,  reviewing,  and  replacing.    Book  catalogs  are  as  easy  to  use  as 
a  telephone  directory— users  are  generally  more  familiar  with  refer- 
ence books  than  with  card  files.    Page  scanning  allows  faster  location 
of  entries  with  similar  entries  and  headings,  quickly  compared  at  a 
glance. 

Finally,  with  advancements  in  computer  technology,  the  quality 
of  the  computer  final  product  has  been  greatly  improved.    The  first 
advancement  was  the  introduction  of  both  upper  and  lower  case  char- 
acters directly  printable  from  the  computer.    The  second  is  the  de- 
velopment of  photocomposition  equipment  which  is  capable  of  taking 
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computer  generated  tapes  and  composing,  with  graphic  arts  quality, 
the  book  catalog  entries.    The  quality  of  the  computer  produced  cata- 
log, which  in  the  past  has  concerned  many  librarians,  has  greatly 
improved. 


(2)  Why  Use  a  Computer  to  Produce  Book  Catalogs  ? 


Computers  are  high  speed  data  processors,  ideally  suited  for 
performing  many  repetitive  clerical  tasks,  providing  that  the  de- 
cisions required  can  be  limited  and  explained.    The  scope  of  the  re- 
quired decisions  and  the  volume  of  the  data  to  be  processed  are  key 
factors  in  deciding  what  data  processing  equipment  is  to  be  used.    It 
is  important  to  realize  that  there  is  a  wide  range  of  computers  with 
varying  limitations— not  the  least  of  which  is  the  craftsman  or  com- 
puter specialist  who  puts  this  tool  to  work.    Electronic  Accounting 
Machines  (EAM)  card  processors  are  quite  limited  and  should  not  be 
confused  with  the  memory  stored-program  computers  that  utilize  tape 
and  disc  storage  devices  as  well  as  punched  cards.    Card  processors 
have  inherent  limitations  when  scanning,  compressing,  selecting,  and 
sorting  data  which  are  required  to  meet  the  challenge  of  the  filing 
rules  and  variations  of  catalog  information. 

Computers  are  used  to  produce  book  catalogs  because  they  offer 
cost  savings,  flexibility,  consistency,  and  selected  retrieval.    The 
converse  of  this  statement  is  also  true  when  the  tool  is  applied  im- 
properly.   An  experienced  computer  specialist  would  not  use  this  tool 
solely  as  a  reproducing  device.    Similarly,  a  librarian  would  not  send 
a  catalog  card  to  a  printing  firm  to  produce  one  copy.    The  profes- 
sional should  know  his  tools  and  when  and  how  to  use  them.    A  com- 
puter pays  off  only  when  it  produces  the  right  information  on  demand, 
but  what  comes  out  depends  not  only  on  what  is  put  in,  but  on  how  it 
is  done. 


(3)  Why  Develop  A  Catalog  Data  Bank? 


The  concept  of  developing  a  catalog  data  bank  is  a  key  factor  in 
utilizing  the  computer  to  produce  book  catalogs.    Cost  savings  and 
standardization  of  catalog  information  are  the  major  benefits  derived 
from  this  concept.    The  Council  on  Library  Resources  has  granted 
$130,000  to  the  Library  of  Congress  for  a  pilot  program  of  distribut- 
ing library  catalog  data  in  machine  readable  form.    Under  this  grant 
the  Library  of  Congress  will  also  study  the  value  and  feasibility  of 
providing  this  service  to  libraries  on  a  broad  scale  and  a  continuing 
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basis.  Successful  completion  of  this  project  can  reduce  even  further 
the  cost  of  computer  generated  book  catalogs,  if  the  primary  content 
of  the  data  bank  is  standardized  to  the  LC  cataloging. 

In  adopting  a  cataloging  standard  such  as  LC,  a  library  can  also 
experience  a  substantial  reduction  in  individual  cataloging  require- 
ments.   This  is  not  to  say  that  cataloging  will  not  be  required  in  the 
future,  but  since  many  libraries  utilize  LC  catalog  cards  in  whole  or 
in  part,  cataloging  can  be  limited  to  those  titles  that  have  not  been 
processed  by  the  Library  of  Congress. 

Professional  librarians  should  have  little  fear  of  automation 
creating  unemployment.    To  quote  from  the  "National  Inventory  of 
Library  Needs,"  by  Edwin  Castagna,  concerning  the  professional 
staff,  "The  national  shortage  in  professionally  trained  staff  to  meet 
ALA  standards  in  these  libraries  would  be  around  100,000.    The  data 
on  present  staff  are  for  'professional  positions  filled'  and  not  pro- 
fessionally trained  librarians.  "2    In  reviewing  the  various  reports  in- 
cluded in  the  National  Inventory  of  Library  Needs,  it  is  apparent  from 
the  deficiencies  highlighted  that  there  is  good  reason  to  limit  original 
cataloging  to  those  items  not  cataloged  by  LC  if  it  does  not  greatly 
aid  the  use  of  the  library  collection. 


Using  a  Computer  to  Produce  Book  Catalogs 


This  section  explains  the  process  used  to  produce  a  book  cata- 
log from  Documentation  Incorporated 's  computer  equipment  consist- 
ing of  an  IBM  1410  and  1401  tape  system.    Figure  1  illustrates  the 
general  systems  flow  for  computer  production  of  book  catalogs. 
Figures  2-7  are  actual  samples  derived  from  the  Baltimore  County 
Public  Library  book  catalog  project. 

Each  step  in  the  general  systems  flow  is  described  briefly  as 
follows: 

The  Catalog  Card.    The  catalog  card,  which  is  normally  placed 
in  the  card  drawer,  is  the  primary  source  of  input  for  computer  pro- 
cessing.   This  is  particularly  true  for  those  titles  that  have  already 
been  cataloged  or  are  processed  by  a  national  organization  such  as 
the  Library  of  Congress.    When  the  individual  library  system  per- 
forms the  cataloging,  a  standard  input  form  should  be  used  for  re- 
cording the  pertinent  data.    The  use  of  a  standard  form  with  pre- 
printed identification  of  each  data  element  to  be  included  in  the 
computer  file  eliminates  the  need  for  additional  machine  coding  and 
can  be  used  in  the  ordering  process. 

To  secure  a  book  catalog,  a  library  must  provide  an  identifi- 
cation number  for  those  titles  contained  in  or  added  to  its  collection. 
The  identifying  number  for  titles  cataloged  by  LC  is  the  LC  card 
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number  and  is  used  by  the  individual  library  to  call  the  title  from  the 
data  bank  for  the  book  catalog.    Those  titles  not  cataloged  by  LC  re- 
quire an  identification  number  and  processing  using  the  standard  input 
form. 

Coding,  Editing,  and  Proofing.    The  first  function  in  the  process- 
ing is  that  of  coding  and  editing  the  catalog  information  for  machine 
preparation  and  is  only  necessary  for  those  titles  not  already  included 
in  the  data  bank  master  file.    A  typical  BCPL  entry  that  has  been 
coded  and  edited  is  illustrated  in  Figure  2.    The  handwritten  marks 
or  "tags"  identify  each  element  of  catalog  information  and  are  used 
throughout  the  computer  system  for  selecting,  formatting,  and  gener- 
ating field  descriptions.    The  standard  input  form,  used  by  a  library 
to  catalog  titles  that  are  unique  to  its  collection,  should  contain  the 
tag  identifiers. 

Machine  Preparation.    The  machine  preparation  function  con- 
verts the  written  information  into  machine  language.    An  80-column 
Hollerith  (IBM)  card  is  the  most  popular  form  of  computer  input.    In 
preparing  the  catalog  entry  for  the  computer  system,  each  tagged 
item  is  key  punched  into  a  card.    If  the  information  for  an  item  ex- 
ceeds the  card  limit  of  80  columns,  a  second  or  third  follow-on  card 
is  prepared.    A  special  code  is  punched  to  indicate  that  the  following 
character  is  to  be  capitalized  when  required. 

The  conversion  of  written  information  into  machine  language 
may  also  be  accomplished  effectively  using  punched  paper  tape  or 
optical  scanning  equipment.    The  state  of  the  art  of  optical  scanning 
still  requires  rigid  control  of  the  source  document  format  and  char- 
acter style. 

Computer  Edit  and  Update.    The  computer  edit  and  update  phase 
of  the  processing  system  performs  the  file  maintenance  and  is  sepa- 
rate from  the  actual  generation  of  the  book  catalog.    Emphasis  is 
placed  on  maintaining  an  accurate  data  bank  file.    The  five  basic 
steps  of  this  phase  follow: 

a.  Read  the  punched  cards  and  create  a  magnetic  tape  of  the 
card  information. 

b.  Sort  the  magnetic  tape  into  ascending  order. 

c.  Edit  the  information  to  determine  if  the  tags  and  machine 
codes  are  properly  encoded. 

d.  Update  the  existing  master  magnetic  tape  file  with  the  new 
information. 

e.  Print  the  proof  listing  of  all  changes  to  the  file  in  upper  and 
lower  case. 

The  IBM  1401  performs  steps  (a)  and  (e)  and  the  IBM  1410  per- 
forms steps  (b)-(d).    A  new  magnetic  tape  file  is  created  during  the 
updating  process  and  reflects  the  latest  version  of  the  data  bank  file. 
This  tape  is  used  as  input  for  the  next  cycle. 
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A  sample  of  the  proof  listing  from  step  (e)  is  illustrated  in 
Figure  3.    Errors  in  coding  and  machine  preparation  are  corrected 
on  the  proof  listing,  and  the  file  maintenance  cycle  is  repeated.    The 
proof  listing  contains  printed  information  that  is  not  recorded  on  the 
original  coded  entry.    This  information  is  computer  generated  by 
identifying  the  tags  and  is  used  to  facilitate  proofing  and  proper  fil- 
ing.   In  comparing  subject  terms  on  the  proof  listing,  tag  72001  does 
not  contain  the  punctuation  and  capitalization  that  appear  in  tag  72002. 
This  is  a  computer  generated  sort  field  and  can  be  changed  in  the  next 
update  cycle  if  it  does  not  conform  to  the  required  filing  rules.    Sort 
fields  are  generated  for  each  major  heading  and  are  a  very  important 
feature  of  this  system. 

Computer  Generated  Catalogs.    Once  a  corrected  master  file 
has  been  created,  the  generation  of  the  book  catalog  becomes  a  me- 
chanical process  performed  in  the  four  steps  that  follow: 

a.  Select  from  the  master  file  the  information  required  for 
each  of  the  indexes. 

b.  Sort  the  major  headings  for  each  index. 

c.  Format  for  printing. 

d.  Print  the  indexes. 

It  is  the  decision  of  the  individual  library  system  that  deter- 
mines what  indexes  are  necessary  and  the  contents  of  each  index  en- 
try.   Figures  4-7  are  reproductions  of  the  actual  full  page  IBM  1401 
printout  for  the  subject,  author,  and  title  indexes  requested  by  the 
Baltimore  County  Public  Library.    Variations  of  the  original  entry 
(see  Figure  2)  are  illustrated  in  each  of  the  indexes.    Examination 
will  also  reveal  the  series  statement  as  a  major  heading  in  the  title 
index  (see  Figure  7). 

It  is  important  to  reiterate  the  flexibility  that  is  available  in 
applying  computer  systems  to  producing  book  catalogs.    For  example, 
if  a  library  decides  that  the  next  publication  of  the  book  catalog  should 
not  have  series  statements  as  a  major  heading  in  the  title  index, 
there  is  no  need  to  change  any  of  the  entries  that  have  been  incorpo- 
rated in  the  master  file.    This  is  automatically  accomplished  by  in- 
structions to  the  computer  selection  program. 

Overview  Print  and  Bind.    The  overview  phase  of  this  process 
is  to  insure  the  accuracy  of  the  final  product.    It  is  not  necessary  to 
cut,  paste,  or  strip  the  computer  print-out  prior  to  making  a  plate  for 
the  printing.    In  other  words,  the  computer  print- out  is  the  camera 
ready  copy.    Printing  and  binding  of  the  final  product  is  the  same 
process  as  for  other  types  of  publications. 

LC  Entries  and  Photocomposition.    The  concept  employed  to 
produce  book  catalogs  for  the  Baltimore  County  Public  Library  ap- 
plied to  producing  the  sample  LC  entries  in  Figure  8.    The  contents 
of  the  printed  entry  in  each  of  the  indexes  are  subject  to  decisions  of 
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the  individual  library  system.    It  is,  however,  more  economical  to 
print  the  full  entry  in  one  index  and  abbreviated  versions  of  the  entry 
in  the  remaining  indexes. 

Photocomposition  enhances  the  appearance  and  reduces  the  num- 
ber of  pages  required  in  a  book  catalog.    Figure  9  illustrates  a  sam- 
ple photocomposed  BCPL  catalog  page.    Photocomposition  devices, 
which  can  be  driven  from  computer  generated  tapes,  offer  the  flexi- 
bility of  type  styles  and  graphic  arts  quality  available  in  typesetting. 
This  process  can  be  justified  when  the  number  of  pages  and  copies  is 
substantial,  or  the  publication  is  semi-permanent  such  as  an  annual 
or  biennial  cumulative  book  catalog. 

Looking  Forward 

There  are  many  advantages  to  automating  library  systems  such 
as  simplifying  the  ordering  and  lending  processes  using  computer 
techniques.    Certainly  computers  can  improve  reference  services  by 
making  it  possible  to  provide  demand  bibliographies.    However,  most 
of  us  are  looking  to  exciting  future  applications  of  computers  to  infor- 
mation processing.    Wouldn't  it  be  nice  to  push  a  few  buttons  on  the 
touch-tone  telephone  that  would  bring  a  favorite  novel  beaming  through 
your  television  set  at  home?    Impractical?    Today,  the  answer  is  yes, 
but  look  at  the  developments  over  the  past  several  decades— television 
and  video  tapes,  computers  linked  by  telecommunications,  touch-tone 
dialing  systems,  and  last  but  not  least,  the  catalog  card  in  machine 
or  computer  form. 

One  major  step  forward  is  to  have  a  convenient  display  of  the 
contents  of  the  library  collections  at  the  user's  disposal  and  the  li- 
brary collection  in  a  machine  form  ready  to  respond  to  inquiries.  The 
key  to  matching  the  user's  need  for  knowledge  about  the  collection 
and  the  mechanics  of  the  system  is  the  published  catalog.    Advance- 
ments in  technology  will  make  this  service  economical  on  a  wide 
basis,  but  there  is  a  real  need  for  libraries  to  prepare  for  the  future; 
publications  and  educational  programs  will  be  required  for  the  users 
in  addition  to  a  conversion  of  the  physical  form  of  the  collection  into 
a  computerized  data  bank. 
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Figure  7 
Series  Entry  in  Title  Catalog 
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TA7.045  64-35744 
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Figure  8 
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LIBRARY  APPLICATIONS  OF  DATA 
PROCESSING:    AN  ENUMERATIVE  BIBLIOGRAPHY,  1964-65 


Compiled  by  James  Kr  ike  las 


The  purpose  of  this  bibliography,  prepared  for  the  participants 
of  the  1966  Clinic,  is  to  provide  a  convenient  list  of  references  to 
actual  applications  of  data  processing  machines  for  the  mechanization 
of  library  routines.    Items  published  in  1964  that  were  included  in  a 
similar  list  distributed  in  1965  have  been  excluded.*   In  addition  to 
Library  Literature  and  Library  Science  Abstracts,  a  new  quarterly, 
Documentation  Abstracts  (which  includes  the  former  "Literature 
Notes"  section  of  American  Documentation),  should  be  useful  for  cur- 
rent references.    For  retrospective  citations  to  operational  systems 
the  references  cited  below  may  be  useful.    (It  should  be  noted  that  the 
scope  of  each  is  not  identical  and  some  duplication  has  occurred.) 

Helen  E.  Loftus  and  Allen  Kent.    "Automation  in  the  Library— 
An  Annotated  Bibliography,"  American  Documentation,  7:110- 
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Edward  Mack  McCormick.    "Bibliography  of  Mechanized  Li- 
brary Processes."    In  Herbert  Goldhor,  ed.,  Proceedings  of  the 
1963  Clinic  on  Library  Applications  of  Data  Processing.    Cham- 
paign, 111.,  University  of  Illinois  Graduate  School  of  Library 
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Robert  S.  Taylor.    Bibliography  of  Recent  Reports  and  Books 
on  Library  Automation  and  Library  Systems  Analysis.    (Li- 
brary Systems  Analysis,  Report  No.  l)   Bethlehem,  Pa.,  Center 
for  the  Information  Sciences,  Lehigh  University,  1964. 


*James  Krikelas.    "Library  Applications  of  Data  Processing:  A 
Selected  Bibliography,  1963-64."    In.  Frances  B.  Jenkins,  ed.,  Pro- 
ceedings of  the  1965  Clinic  on  Library  Applications  of  Data  Process- 
ing.   Champaign,  111.,  University  of  Illinois  Graduate  School  of  Library 
Science,  1966,  pp.  193-201. 
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College  &  Research  Libraries,  26:311-315+,  July  1965. 
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by  Camera  and  Computer, "  Library  Resources  &  Technical  Services, 
9:205-206,  Spring  1965. 
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Serials 


25.  Brodman,  Estelle,  and  Moore,  Evelyn  A.    "Mechanized 
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Bureau  of  Ships,  1964. 

45.  U.  S.  George  C.  Marshall  Space  Flight  Center,  Project  & 
Industry  Applications  Division.    The  Alpha  System:    A  Progress  Re- 
port on  Phase  I  of  a  Plan  to  Automate  the  Operation  of  the  Redstone 
Scientific  Information  Center.    Huntsville,  Ala.,  Project  &  Industrial 
Applications  Division,  Computation  Division,  1964. 


Experimental  and  Proposed  Systems 


46.  Buckland,  Lawrence  F.    The  Recording  of  Library  of  Con- 
gress Bibliographical  Data  in  Machine  Form;  A  Report  Prepared  for 
the  Council  on  Library  Resources,  Inc.    Rev.    Washington,  Council  on 
Library  Resources,  Inc.,  1965. 

47.  Hayes,  Robert  M.    "The  Concept  of  an  On-Line,  Total  Li- 
brary System,"  Library  Technology  Reports,  May  1965. 

48.  Raizada,  A.  S.    "Automation  in  Documentation:    Union  Cata- 
logue Processing  by  Digital  Computer,"  Annals  of  Library  Science 
and  Documentation,  9:54-76,  Dec.  1964. 

49.  Snyder,  Samuel  S.    "Automation  at  LC:    Philosophy,  Plans, 
Progress,"  Library  Journal.  90:4709-14,  Nov.  1,  1965. 

50.  Stewart,  Bruce  Warren.    A  Computerized  Serials  Record 
System  for  the  Texas  A  &  M  University  Library.    College  Station, 
Texas  A  &  M  University  Library,  1965. 


Author  Index 
(Numbers  refer  to  bibliography  items.) 

Becker,  J 34  Cammack,  F.  M 16 

Boylan,  M.  N 9  Corbin,  J.  B 1 

Bregzis,  R.    35  Cox,  C.  C 2 

Brown,  J.  E 36 

Brodman,  E 25  De  Gennaro,  R 3 

Buckland,  L.  F 46 


217 

Dejarnett,  L.  R 17  Nicolaus,  J.  J 43 

Divett,  R.  T 38 

Drasnin,  R 26  Palmer,  F.  M 21 

Perrault,  J 13 

Enlow,  H 4  Plant,  K 22 

Felter,  J.  W 27 

Raizada,  A.  S 48 

Hammer,  D.  P 28  Robinson,  C.  W 14 

Hayes,  R.  M 47  Roy,  R.  H 23 

Jackson,  W.  C 7  Schultheiss,  L.  A 6,  31 

Jean,  G.  L 18  Shaw,  R.  R 24 

Jones,  R.  C 8  Snyder,  S.  S 49 

Kee,  W.  A 29  Stewart,  B.  W 50 

Kennedy,  J.  H 9,  19 

Kibbee,  J.  M 41  Tjoeng,  D.  S 27 

Kilgour,  F.  G 10,  11 

Koch,  M.  S 5  Vdovin,  G 32 

Lebowitz,  A.  I 29  Wilkinson,  J.  P 33 

Livingston,  F.  G 30  Wilkinson,  W.  A 15 

McCoy,  R.  E 20 

Meakin,  A.  0 12 

Moore,  E.  A 25,  42 


Library  &  Project  Index 


Association  of  Atlantic  Universities  (Canada) 33 

Atomic  Energy  Commission  Headquarters  Library 29 

Baltimore  County  Public  Library 14 

Barnet,  London  Borough  of,  Public  Library 12 

Canada.    Department  of  Agriculture  Library 18 

Canadian  National  Research  Council  Library,  National  Science 

Library  of  Canada  (Ottawa) 36 

Columbia- Harvard -Yale  Medical  Libraries  Project 10,  11 

Dalhousie  University  (Halifax,  N.  S.) 33 

Downstate  Medical  Center  (Brooklyn,  N.  Y.) 5 

Florida  Atlantic  University 13,  34 


218 

Harvard  University 3,  21 

Johns  Hopkins  University 23,  39,  40 

IBM  Advanced  Systems  Development  and  Research  Library.  ...     47 
India  Institute  of  Technology  (Kanpur)  Computer  Centre 48 

Lawrence  Radiation  Laboratory,  University  of  California 

(Livermore)      9,  19 

Los  Angeles  State  College  Library 41 

Medical  Library  Center  of  New  York 27 

Monsanto  Company  (St.  Louis)  Information  Center 15 

Nassau  (N.  Y.)  Library  System    37 

Ohio  Agricultural  Research  and  Development  Center  (Wooster) 

(formerly,  Ohio  Agricultural  Experimental  Station) 4 

Ontario  New  Universities  Library  Project 35 

Purdue  University 28 

Rand  Corporation 26 

Redstone  Scientific  Information  Center 45 

St.  Louis  Junior  College  District 8 

Southern  Illinois  University 17,  20 

Suffolk  (N.  Y.)  Cooperative  Library  System 37 

Texas  A  &  M  University 50 

Texas  State  Library 1 

Union  Catalog  of  Medical  Periodicals  "Project" 27 

U.  S.  Bureau  of  Ships  Technical  Library 43,  44 

U.  S.  Library  of  Congress 46,  49 

University  of  California,  San  Diego 32 

University  of  Colorado 7 

University  of  Hawaii    16,  24 

University  of  Illinois,  Chicago 6,  31 

University  of  Louisville  (Kornhauser  Memorial  Medical 

Library) 30 

University  of  Maryland 2 

University  of  New  Mexico  (Library  of  the  Medical  Sciences)  ...     38 
University  of  Toronto 35 

Washington  University  (St.  Louis)  School  of  Medicine  Library  .  25,  42 


UNIVERSE  OF  ILLINOS-URBANA 


30112071033317 


